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(54) Title: METHODS FOR RAPID IDENTIFICATION OF PATHOGENS IN HUMANS AND ANIMALS 
(57) Abstract: The present invention provides methods of: identi^ng pathogens in Kological sj 



, „ c -r m humans and animals. 

resolving a plurality of eiiologic agents present in samples obtained torn humans and animals, determining detailed genetic infor- 
mation about such pathogens oretiologic agents, and rapid detection and identification of bioagents 6om environmental, clinical or 

other samples. 
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METHODS FOR RAPID IDENTIFICATION OF PATHOGENS 
IN HUMANS AND ANIMALS 

FIELD OF THE INVENTION 
5 The present invention relates generally to clinical applications of directed to the 

identification of pathogens in biological samples from humans and animals. The present 
invention is also directed to the resolution of a plurality of etiologic agents present in samples 
obtained fiwm humans and animals. The invention is further directed to the determination of 
detailed genetic information about such pathogens or etiologic agents. 

10 The identification ofibe bioagent is important for determining a proper course of 

treatment and/or eradication of the bioagent in such cases as biological warfere and natural 
infections. Furthermore, the determination of the geographic origin of a selected bioagent will 
facilitate the identification of potraitial criminal identity. The present invention also relates to 
methods for rapid detection and identification of bioagents from environmental, clinical or other 

15 samples. The mefliods provide for detection and characterization of a unique base composition 
signature (BCS) from any bioagent, mcluding bacteria and vinises. The unique BCS is used to 
rapidly identify the bioagent 

BACKGROUND OF THE INVENTION 

20 In die United States, hospitals report well over 5 million cases of recognized infectious 

disease-related illnesses annually. Significantly greater numbers remain undetected, bodx in the 
inpatient and community setting, resulting in substantial morbidity and mortality. Critical 
intervention for infectious disease relies on rapid, sensitive and specific detection of the 
offending pathogen, and is central to die mission of microbiology laboratories at medical centers. 

25 Unfortunately, despite the recognition diat outcomes fixmi infectious iUnesses are.directiy 
associated with time to pathogen recognition, as well as accurate identification of flie class and 
species of microbe, and ability to identify the presence of drug resistance isolates, conventional 
hospital laboratories often remain encumbered by traditional slow multi-step culture based 
assays. Other limitations of the conventional laboratory v*ich have become increasingly 

30 apparent include: extremely prolonged wait-times for pathogens with long generation time (up to 
several' vreeks); requirements for additional testing and wait times for speciafion 
identification of antimicrobial resistance; diminished test sensitivity for patients who have 
received antibiotics; and absolute inability to culture certain pafliogens in disease states 
associated with microbial infection. 
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For more fhan a discade, molecular testing has been heralded as the diagnostic tool for 
the new millennium, \vhose ultimate potential could include forced obsolescence of traditional 
hospital laboratories. However, despite die fact that significant advances in clinical application of 
PCR techniques have occurred, die practicing physician still relies principally on standard 
5 techniques. A brief discussion of several existing applications of PCR in the hospit^-based 
setting follows. 

Generally, speaking molecular diagnostics have been championed for identifying 
organisms that cannot be grown in vitro, or in instances where existing culture techniques are 
insensitive and/or require prolonged incubation times. PCR-based diagnostics have been 

10 successfully developed for a wide variety of microbes. Application to the clinical arena has met 
with variable success, with only a few assays achieving acceptance and utility. 

One of the earliest, and perhaps most widely recognized applications of PCR for clinical 
practice is in detection of Mycobacteritm tuberculosis. Clinical characteristics favoring 
development of a nonculture-based test for tuberculosis include week to month long delays 

1 5 associated with standard testing, occurrence of drug-resistant isolates and public health 

imperatives associated with recognition, isolation and treatment Although frequently used as a 
diagnostic adjunctive, practical and routine clinical application of PCR remams problematic due 
to significant inter-laboratory variation in sensitivity, and inadequate specificity for use in low 
prevalence populations, requiring further development at the technical level. Recent advances in 

20 the laboratory suggest that identification of drug resistant isolates by amplification of mutations 
associated witii specific antibiotic resistance (e.g., rpoB gene in lifiunpin resistant strains) may 
be forthcoming for clinical use, although widespread application will require extensive clinical 
validatioiL 

One diagnostic assay, ^diich has gained widespread acceptance, is for C. trachomatis. 

25 Conventional detection systems are luniting due to inadequate sensitivity and specificity (direct 
immunofluorescence or enzyme inmiunoassay) or die requirement for specialized culture 
faciUties, due to the &stidious characteristics of this microbe. Laboratory development, followed 
by widespread cUnical validation testing in a variety of acute and nonacute caie settings have 
demonsti-ated excellent sensitivity (90-100%) and specificity (97%) of the PCR assay leading to 

30 its commercial development Proven efficacy of the PCR assay fixmi bofli genital and urine 
sampling, have residted in its application to a variety of clinical setting, most recentiy induding 
routine screening of patients considered at risk. 

While the full potential for PCR diagnostics to provide rapid and critical information to 
physicians faced with difficult clinical-decisions has yet to be realized, one recentiy developed 
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assay provides an example of the promise of this evolving technology. Distinguishing life- 
threatening causes of fever from more benign causes in children is a fundamental clinical 
dilemma faced by clmicians, particularly when infections of the central nervous system are being 
considered. Bacterial causes of meningitis can be highly aggressive, but generally cannot be 
5 dififerentia'ted on a clinical basis from ascitic meningitis, which is a relatively benign condition 
&at can be mans^d on an ou^atient basis. Existing blood culture methods often take several 
days to turn positive, and are often confounded by poor sensitivity or &lse-nfigative findings in 
patients receivuig empiric antimicrobials. Testing and application of a PGR assay for enterovnal 
menmgitis has been found to be highly sensitive. Wi& reporting of results within 1 day, 

10 preliminary clinical trials have shown significant reductions in hospital costs, due to decreased 
duration of hospital stays and reduction in antibiotic therapy. Other vkal PGR assays, now 
routinely available include those for herpes amplex virus, cytomegalovirus, hepatitis and HIV. 
Each has a demonstrated cost savings role in clinical practice, including detection of o^rwise 
difficult to diagnose infections and newly realized capacity to monitor progression of disease and 

IS response to therapy, vital in the management of chronic infectious diseases. 

The concept of a universal detection system has been forwarded for identification of 
bacterial pathogens, and speaks most directly to the possible clinical implications of a broad- 
based screening tool for clinical use. Exploiting the existence of highly conserved regions of 
DNA common to all bacterial species in a PGR assay would empower physicians to rapidly 

20 identify the presence of bacteremia, which would profoundly impact patient care. Previous 
empiric decision making could be abandoned in fevor of educated practice, allowing appropriate 
and expeditious decision-inaking regarding need for antibiotic tberapy and hospitalization. 

Experimental work usmg the conserved features of the 16S rRNA common to ahnost all 
bacterial species, is an area of active investigation. Hospital test sites have focused on "high 

25 yield" clinical settings where expeditious identification of the presence of systemic bacterial 
infection has. immediate high morbidity and mortality consequences. Notable clinical infections 
have included evaluation of febrile infants at risk for sepsis, detection of bacteremia in febrile 
neutropenic cancer patients, and examination of critically ill patients in the intensive care unit 
While several of these studies have reported promismg results (with sensitivity and specificity 

30 weU over 90%), significant technical difficulties (described below) remain, and have prevented 
general acceptance of this assay in clinics and hospitals (wBch remam dqpdident on standard' 
blood culture methodologies). Even the revolutionary advances of real-time PGR technique, 
vMeh. offers a quantitative more reproducible and technically sunpler system, remains 
encumbered by mherent technical limitations of the PGR assay. 
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The principle shortcomings of applying PCR assays to the clinical setting include: 
inability to eliminate background DNA contammation; interference with the PCR amplification 
by substrates present in flie reaction; and limited capacity to provide rapid reliable speciation, 
antibiotic resistance and subtype identification. Some laboratories have recently made progress 

5 in identifying and removing inhibitors; however background contamination remains iMoblematic, 
and mefliods directed towards eliminating exogenous sources of DNA report significant 
duninution in assay sensitivity. Finally, vMe product identification and detailed characterization 
has been achieved using sequencing techniques, these approaches are laboriotis and time- 
intensive thus detracting from its clinical applicability. 

0 Rapid and definitive microbial identification is desirable for a variety of industrial, 

medical, environmental, quality, and research reasons. Traditionally, the microbiology laboratory 
has functioned to identify the etiologic agents of infectious diseases through direct examination 
and culture of specimens. Since the mid-1980s, researchers have repeatedly demonstrated the 
practical utility of molecular biology techniques, many of which form the basis of clinical 

5 diagnostic assays. Some of these techniques include nucleic acid hybridization analysis, 
restriction enzyme analysis, genetic sequence analysis, and separation and purification of nucleic 
acids (See, e.g., J. Sambrook, E. F. Fritsch, and T. Maniatis, Molecular Cloning: A Laboratory 
Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989). These 
procedures, in general, are time-consuming and tedious. Another option is the polymerase chain 

0 reaction (PCR) or other amplification procedure that amplifies a specific target DNA sequence 
based on the flanking primers used. Finally, detection and data analysis convert the hybridization 
event into an analytical result 

Other not yet fiilly realized applications of PGR for clinical medicine is the 
identification of infectious causes of disease previously desaibed as idiopadiic (e.g. Bartonella 

5 henselae in bacillary angiomatosis, and Tropheryma whgjpellii as the uncultured bacillus 
associated with Whipple's disease). Further, recent epidemiological studies which suggest a 
strong association between Chlamydia pneumonia and coronary artery disease, serve as example 
of the possible widespread, yet undiscovered links betwera pathogen and host vtiiich may 
ultimately allow for new insig^hts into pathogenesis and novel life sustaining or savmg 

} therapeutics. 

For flie'practicing clinician, PCR technology offers a yet unrealized potential for 
di^ostic omnipotence in the arena of infectious disease. A universal reliable infectious disease 
detection system would certainly become a fundamental tool in the evolving diagnostic 
armamentarium of the 21^ century clinician. For front line onergency physicians, or physicians 
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tvoddng in disaster settings, a quick universal detection ^stem, would allow for molecular triage 
and eaily aggressive targeted tiierapy. Preliminary clinical studies using species specific probes 
suggest that implementing rapid testing in acute care setting is feasible. Resources could tiius be 
ai^ropiiately applied, and patients with suspected infections could rapidly be risk stratified to 
S tiie diffoent treatment settings, depending on tiie pathogen and virulence. Furthermore, links 
witli data management systems, locally regionally and nationally, would allow for efTective 
epidemiological surveillance, witii obvious benefits for antibiotic selection and control of disease 
outbreaks. 

For the hospitalists, tiie ability to speciate and sub^e would allow for more precise 
1 0 decision-making regarding antimicrobial agents. Patients who are colonized with hi^y 
contagious pathogens could be appropriately isolated on mtry into die medical setting without 
delay. Targeted therapy will diminish development of antibiotic resistance. Furthermore, 
identification of the genetic basis of antibiotic resistant strains would permit precise 
phaimacologic intervention. Both physician and patient would benefit with less need for 
15 repetitive testing and elimination of wait times for test results. 

It is certain that the individual patient will benefit directly fixim this approach. Patients 
with unrecognized or difficult to diagnose infections would be identified and treated promptiy. 
There will be reduced need for prolonged inpatient stays, with resultant decreases in iatrogenic 
events. 

20 Mass spectrometry provides detailed information about the molecules being analyzed, 

including high mass accuracy. It is also a process that can be easily automated. Low-resolution 
MS may be unreliable when used to detect some known agents, if their spectral lines are 
sufficiently weak or sufiiciently close to &ose from other living organisms in the sample. DNA 
chips with specific probes can only determme the presence or absence of specifically anticipated 

25 organisms. Because there are hundreds of thousands of species of benign bacteria, some v«y 
similar in sequence to tiueat organisms, even arrays witii 10,000 probes lack the bteadtii needed 
to detect a particular organism. 

Antibodies &ce more severe diversity limitations tiian arrays. If antibodies are designed 
s^ainst highly conserved targets to increase diversity, tiie fidse alarm problem will dominate, 

30 again because threat organisms are very similar to benign ones. Antibodies are only capable of 
detecting known agents in relatively uncluttered environmente^^ " ' ■ 

Several groiq)5 have reported detection of PCR products using high resolution 
electrospray ionizatilon-Fourier transform-ion cyclotron resonance mass spectrometry CESI-FT- 
ICR MS). Accurate measurement of exact mass combined with knowledge of the number of at 
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least one nucleotide allowed calculation of the total base composition for PCR duplex products 
of approximately 100 base pairs. (Aasenid et al., J. Am. Soc. Mass Spec., 1996, 7, 1266-1269; 
Muddiman et al.. Anal. Chem., 1997, 69, 1543-1S49; Wunschd et al.. Anal. Chem., 1998, 70, 
1203-1207; Muddiman et aL, Rev. Anal. Chem., 1998, 17, 1-68). ElectrospKQr ionization-Fourier 
5 transform-ion cyclotron resistance ^SI-FT-ICR) MS may be used to determine the mass of 
double-stranded, SOO base-pair PCR products via tide average molecular mass (Hurst et al.. Rapid 
Commun. Mass Spec. 1996, 10, 377-382). Use of matrix-assisted laser desorption ionization- 
time of flight (MALDI-TOF) mass specbrometiy for characterization of PCR products has been 
described. (Muddiman et al.. Rapid Commun. Mass Spec., 1999, 13, 1201-1204). However, the 
10 degradation of DNAs over about 75 nucleotides observed \vilfa MALDI limited the utili^ of this 
method. 

U.S. Patent No. 5,849,492 reports a method for retrieval of phylogenetically informative 
DNA sequences which comprise searching for a highly divergent segment of genomic DNA 
surrounded by two highly conserved segments, designing the universal primers for PCR 
15 amplification of the highly divergent region, amplifying the genomic DNA by PCR technique 
using universal primers, and then sequencing the gene to determine the identity of the organism. 

U.S. Patent No, 5,965,363 reports mefliods for screening nucleic acids for 
polymorphisms by analyzing amplified target nucleic acids using mass spectrometric techniques 
and to procedures for improving mass resolution and mass accuracy of these methods. 
20 WO 99/14375 reports methods, PCR primers and kits for use m analyzing preselected 

DNA tandem nucleotide repeat alleles by mass spectrometry. 

WO 98/12355 reports methods of determining the mass of a target nucleic acid by mass 
spectrometric analysis, by cleaving the target nucleic acid to reduce its length, making the target 
single-stranded and using MS to determine tiie mass of tiie single-stranded shortened target Also 
25 reported are methods of preparing a double-stranded target nucleic acid for MS analysis 

comprising amplification of the target nucleic acid, binding one of Has strands to a solid support, 
releasing the second strand and then releasing the first strand which is then analyzed by MS. Kits 
for target nucleic acid preparation are also provided. 

PCT WO97/33000 reports methods for detecting mutations in a target nucleic acid by 
30 nonrandomly fiagmenting &e target into a set of smgle-stranded nonrandom lengdi fragments 
and deteiminingr&eir masses by "MS. - " - 

U.S. Patent No. 5,605,798 reports a fast and higihly accurate mass spectrometer-based 
process for detecting the presence of a particular nucleic acid in a biological sample for 
diagnostic purposes. 
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WO 98/21066 reports processes for determining the sequence of a particular target 
nucleic acid by mass spectrometry. Processes for detecting a target nucleic acid present in a 
biological sample by PCR amplification and mass spectrometry detection are imported, as are 
methods for detecting a target nucleic add in a sample by amplifying the target with primers that 
S contain restriction sites and tags, extending and cleaving the amplified nucleic acid, and 
detecting tibie presence of extended product, wherein the presence of a DNA fiagment of a mass 
different from wild-type is indicative of a mutation. Mefliods of sequencing a nucleic acid via 
mass spectrometry methods are also reported. 

WO 97/37041, WO 99/31278 and U.S. Patent No. 5,547,835 report methods of 
10 sequencmg nucleic acids using mass spectrometry. U.S. Patent Nos. 5.622,824, 5.872,003 and 
5,691,141 report methods, systems and kits for exonudease-mediated mass spectrometric 
sequencing. 

Hius, there is a need for a method for bioagent detection and identification which is 
both specific and rapid, and in which no nucleic acid sequencing is required. The present 
15 invention addresses this need. 

SUMMARY OF THE INVENTION 

The present invention is directed towards methods of identifying a pathogen in a 
biological sample by obtaining nucleic acid from a biological sample, selecting at least one pair 

20 of intelligent primers with the capability of amplification of nucleic acid of the pathogen, 
amplifying the nucleic acid with the primers to obtain at least one amplificatioii product, 
determining the molecular mass of at least one amplification product from which the pathogen is 
identified. Further, this invention is directed to methods of epidemic surveillance. By identifying 
a pathogen from samples acquired from a plurality of geographic locations, the spread of tiie 

25 pathogen to a given geographic location can be determined. 

The present invention is also directed to methods of diagnosis of a plurality of etiologic 
agents of disease in an individual by obtaining a biologicd sample fiom an individual, isolating 
nucleic acid from the biologicd sample, selecting a plurdity of amplification primers with the 
capabi% of amplification of nucleic acid of a plurality of etiologic agents of disease, amplifyring 

30 the nucleic acid witii a plurdity of primers to obtain a plurdity of amplification products 
Corresponding to a plurdity of etiologic agcmts, detenaimnff the mo 
plurality of unique amplification products which identify die members of the plurdity of 
etiologic agents. 
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The present invention is also directed to methods of /n silico screening of primer sets to 
be used in identification of a plurality of Inoagents by preparing a base composition probability 
cloud plot fix>m a plurality of base composition signatures of tiie plurality of bioagents generated 
in silico, inspecting the base composition probability cloud plot for overlap of clouds fiom 
5 different bioagents, and choosing primer sets based on minimal overlap offhe clouds. 

The present invention is also directed to methods of predicting the identity of a bioagent 
with a heretofore unknown base composition signature by preparing a base composition 
probability cloud plot fiom a plurality of base composition signatures of the plurality of 
bioagents which mcludes the heretofore unknown base composition, inspecting the base 

10 composition probability cloud for overlap of the heretofore unknown base composition with the 
cloud of a known bioagent such that overly predicts that the identity of the bioagent with a 
heretofore unknown base composition signature equals the identity of the known bioagent 

The present invention is also directed to methods for determining a subspecies 
characteristic for a given pathogen in a biological sample by identifying the pathogen in a 

15 biological sample using broad range survey primers or division-wide primers, selectic^ at least 
one pair of drill-down primers to amplify nucleic acid segments which provide a subspecies 
characteristic about the pathogen, amplifying the nucleic acid segments to produce at least one 
drill-down amplification product and determining the base composition signature of the drill- 
down amplification product wherein the base composition signature provides a subspecies 

20 characteristic about the pathogen. 

The present invention is also directed to methods of pharmacogenetic analysis by 
obtauiing a sample of genomic DNA from an individual, selecting a segment of the genomic 
DNA vMch provides i^armacog^ietic information, using at least one pau of mtelligent primers 
to produce an amplification product "^ch comprises the segmoit of genomic DNA and 

25 determining the base composition signature oftiie amplification product, wherein the base 
composition signature provides pharmacogenetic information about said individual. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures lA-lH and Figure 2 are consensus diagrams that show examples of conserved 
30 regions from 16S rRNA (Fig. lA-1, lA-2, lA-3, 1 A-4, and lA-5), 23S rRNA (3 '-half. Fig. IB, 
IC, and ID; 5'^half, Fig. lE-F), 23S rRNA Doniain l"(Fig: IG), 238 rRNA Domain IV (Fig. IH) 
and 16S rRNA Domam III (Fig. 2) vs^ch are suitable for use in the present invention. Lines vwth 
arrows are examples of regions to which intelligent primer pairs for PGR are designed. The label 
for each primer pair represents the starting and ending base number of the amplified region on 
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the consensus diagram. Bases in capital letters are greater dian 95% conserved; bases in lower 
case letters are 90-95% conserved, filled circles are 80-90% conserved; and open circles are less 
than 80% conserved. The label for each primer pair represents the starting and ending base 
number of the amplified region on the consensus diagram. The nucleotide sequence of the 16S 
5 rRNA consensus sequence is SEQ ID N0:3 and the nucleotide sequence of the 23S rRNA 
consensus sequence is SEQ ID NO:4. 

Figure 2 shows a typical primer amplified region fix>m ihe 16S rRNA Domain IE shown 
in Figure lA-1. 

Figure 3 is a schematic diagram showing conserved regions in RNase P. Bases in capital 
10 letters are greater than 90% conserved; bases in lower case letters are 80-90% conserved; filled 
circles designate bases which are 70-80% conserved; and open circles designate bases that are 
less than 70% conserved. 

Figure 4 is a schematic diagram of base composition signature determination using 
nucleotide analog 'tags" to determine base composition signatures. 
1 5 Figure 5 shows the deconvoluted mass spectra of a Bacillus anthracis region vath and 

without the mass tag phosphorothioate A (A*). The two spectra differ in that the measured 
molecular weight of the mass tag-containing sequence is greater than the immodified sequence. 
Figure 6 shows base composition signature (BCS) spectra from PGR products from 
, Staphylococcus aureus (S. aureus 16S_1337F) and Bacillus anthracis (B. anthr. 16S_1337F), 
20 amplified using the same primers. The two strands differ by only two (AT~>CG) substitutions 
I and are clearly distinguished on the basis of their BCS. 

Figure 7 shows that a single difference between two sequences (A14 m B. anthracis vs. 
A15 in A cereus) can be easily detected using ESI-TOF mass spectrometry. 

Figure 8 is an ESI-TOF of Badllus anthracis spore coat protein sspE 56mer plus 
25 calibrant The signals unambiguously identify B. anthracis versus other Bacillus species. 

Figure 9 is an ESI-TOF of a B. anthracis synthetic 16S_1228 duplex (reverse and 
forward strands). The technique easily distinguishes between the forward and reverse strands. 
Figure 10 is an ESI-FTICR-MS of a synthetic B. anthracis 16S_1337 46 base pair 

duplex. 

30 Figure 1 1 is an ESI-TOF-MS of a 56mer oUgonucleotide (3 scans) fcam. the B. anthracis 

saspB gene widi an internal mass sl^idard. The intetnai inass siianidaixls are designated by 
asterisks. 
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Figiue 12 is an ESI-TOF-MS of an internal standard \ntfa 5 mM TBA-TFA buffer 
showing that charge stiipi»ng viilb tiibutylanuDonium trifiuoroacetate reduces the most abundant 
charge state from [M-8H+]8- to [M-3H+]3-. 

Figure 13 is a portion of a secondary structure defining database according to one 
S embodhnent of ftie present mvention, where two examples of selected sequences are displayed 
graphically thereunder. 

Figure 14 is a three dimensional graph demonstrating ttie grouping of sample molecular 
weight according to species. 

Figure 15 is a three dimensional graph demonstrating the grouping of sample molecular 
10 weights according to species of vhus and mammal infected 

Figure 16 is a three dimensional graph demonstrating the grouping of sample molecular 
weights according to species of virus, and animal-origin of infectious agent 

Figure 17 is a figure depicting how a typical triangulation method of the present 
invention provides for die identification of an unknown bioagent without prior knowledge of the 
15 unknown agent. The use of different primer sets to distinguish and identify the unknown is also 
depicted as primer sets I, II and III within this figure. A three-dimensional gr^h depicts all of 
bioagent space (170), mcluding the unknown bioagent, which after use of primer set I (171) 
according to a method according to the present invention further differentiates and classifies 
bioagents according to major classifications (176) which, upon further analysis using primer set 
20 n (172) differentiates the unknown agent (177) fi'om other, known agents (173) and finally, tihe 
use of a third primer set (175) furOier specifies subgroups within the &mily of the unknown 
(174). 

Figure 18 shows a reixesoitative base composition probability cloud for a region of the 
RNA polymerase B gene from a cluster of enteiobacteria. The dadc spheres represent the actual 

25 base composition of die organisms. The lighter spheres represent the transitions among base 
compositions observed in diffemxt isolates of the same species of organism. 

Figure 19 shows resolution of enterobacteriae members with primers targeting RNA 
polymerase B (rpoB). A single psur of primers targeting a hyper-variable region within ipoB was 
sufBcient to resolve most members of this grotq) at the genus level Salmonella from Escherichia 

30 from Yersinia) as well as flie species/strain level (E. coli K12 fix>m 0157). All organisms vwth 
the exception of Y. pestis were tested in the lab and the measured biase counts (shown with 
arrow) matched the predictions in every case. 
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Figure 20 shows detection of S. aureus in blood. Spectra on the right indicate signals 
corresponding to S. aureus detection in spiked wells A J and A4 with no detection in control 
wells A2 and A3. 

Figure 21 shows a representative base composition distribution of humaa adenovirus 
5 strain types for a single primer pair region on Ihe hexon gene. The cireles r^resent (tifferent 
adenovirus seqwnces in our database that were used for primer design. Measurement of masses 
and base counts for each of the unknown samples A, B, C and D matdied one or more of flie 
known groups of adenoviruses. 

Figure 22 shows a representative broad range survey/drill-down process as applied to 
10 emm-typing of streptococcus pyogenes (Group A Str^tococcus: GAS). Genetic material is 
extracted (201) and amplified usmg broad range survey primers (202). The amplification 
products are analyzed (203) to detemune die presence and idaitity of bioagents at the species 
level. ]i Streptococcus pyogenes is detected (204), the enam-typing "drill-down" primers are 
used to reexamine the extract to identify tte emm-type of the sample ^05). Different sets of 
15 drill down primers can be employed to determine a subspecies characteristic for various strains 
of various bioagents (206). 

Figure 23 shows a representative base composition distribution of bioagents detected in 
throat swabs firom military personnel using a broad range primer pair directed to 16S rRNA. 

Figure 24 shows a representative deconvoluted ESI-FTICR spectra of the PCR products 
20 produced by the gti primer for samples 12 (top) and 10 (bottom) correspondmg to emm types S 
and 6, respectively. Accurate mass measurements were obtained by using an internal mass 
standard and post-calibrating each spectrum; the ejqjerimental mass measurement uncertainty on 
each strand is + 0.035 Daltons (1 ppm). Unambiguous base compositions of the amplicons were 
determined by calculating all putative base compositions of each stand within the measured mass 
25 (and measured mass uncertamty) and selecting complementary pairs widiin the mass 
measuremoit uncertainty. In dl cases there was only one base composition within 25 ppm. The 
measured mass diffisience of 15.985 Da between the steands shown on die left is in excellent 
agreement with the theoretical mass difference of 15.994 Da expected for an A to G substitution. 
Figure 25 shows representative results of the base composition analysis on throat swab 
30 samples using the six primer pairs, 5'-emm gene sequencing and the MLST gene sequencing 
"method of tiie present invention for an outbreak of Streptococcus pyogenes (group A 
streptococcus-, GAS) at a military training camp. 

Figure 26 shows: a) a representative ESI-FTICR mass spectrum of a restiiction digest of 
a 986 bp region of the 16S ribosomal gene from E. coli K12 digested wifli a mixture of BstNJ, 
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BmFI, Bfol and Ncol, b) a deconvolved lepiesentation (neutral mass) of the above spectnun 
showing the base compositions derived fiom accinate mass measurements of each fragment; and 
c) a representative reconstnicted restriction map showing complete base composition coverage 
for nucleotides 1-8S6. Use Ncol did not cut . 
S Figure 27 shows a r^resentative base composition distribution of /TOxvnrMjej for a 

single primer pair region on the DNA-dependent polymerase B gene (DdDpB). The spheres 
represent different poxvirus sequences that were used for prim^ design. 

DESCRIPTION OF EMBODIMENTS 

10 The present invention provides, inter alia, methods for detection and identification of 

bioagents in an unbiased maimer using "bioagent identifying amplicons." "Intelligent primers" 
are selected to hybridize to conserved sequence regions of nucleic acids derived fi-om a bioagent 
and vMch bracket variable sequence regions to yield a bioagent identifying amplicon which can 
be amplified and which is amenable to molecular mass determination. The molecular mass then 

15 provides a means to uniquely identify the Ho^ent wdthout a requirement for prior knowledge of 
the possible identity of the bioagent The molectilar mass or corresponding "base composition 
signature" (BCS) of the amplification product is then matched against a database of molecular 
masses or base composition signatures. Furthermore, the method can be applied to rapid parallel 
"multiplex" analyses, the results of v^ch can be employed m a triangulation identification 

20 strategy. The present method provides K^id throughput and does not require nucleic acid 
sequencing of the amplified target sequence for bioagent detecticxi and identification. 

In tiie context of tiiis invention, a "bioagenf ' is any organism, cell, or virus, living or 
dead, or a nucleic add derived fix>m such an organism, cell or viius. ^camples of bioagents 
include, but are not limited, to cells (mcluding, but not limited to, human clinical samples, 

25 bacterial cells and otiier patiiogens) viruses, fimgi, and protists, paraates, and pathogenicity 
markers (mcluding, but not limited to, patiiogenicity islands, antibiotic resistance genes, 
virulence fitctors, toxin g^es and other bioregulating compounds). Samples may be alive or 
dead or in a vegetative state (for example, vegetative bacteria or spores) and may be 
encapsulated or bioengmeered. In the context of tiiis invention, a 'pathogen" is a bioagent that 

30 causes a disease or disorder. 

" Despfteenormbus-biologicd diversity.aU forms of life on ear^ 
common features in their genomes. Bacteria, for example have highly conserved sequences in a 
variety of locations on their genomes. Most notable is the universally conserved region of the 
ribosome, but there are also conserved elements m other non-coding RNAs, including RNAse P 
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and the signal recognition particle (SRP) among otiiers. Bacteria have a common set of 
absohitely required genes. About 250 genes are present in all bacterial species (Mushegian et al., 
Proc. Natl. Acad Sci. U.S.A., 1996. 93. 10268; and Fraser et al.. Science, 1995, 270, 397), 
including tiny genomes like Afycoplasma, Ureaplasma and Rickettsia. These genes encode 
5 proteins involved in translation, replication, recombination and repair, transcription, nucleotide 
metabolism, amino waA. metabolism, lipid metabolism, weigy generation, uptake, secretion and 
the like. Examples of these proteins are DNA polymerase m beta, elongation factor TU, heat 
shock protein gioEL, RNA polymerase beta, phosphoglycerate kinase, NADH ddiydrogenase, 
DNA ligase, DNA topoisomerase and elongation fector O. Qperons can also be targeted using 

10 the present method. One example of an operon is the bQ> operon fiom enteropathogenic E. coli. 
Multiple core chromosomal genes can be used to classify bacteria at a genus or genus species 
level to determine if an organism has threat potential. The me&ods can also be used to detect 
patiiogenicity markers (plasmid or chromosomal) and antibiotic resistance genes to confirm the 
threat potential of an organism and to direct countermeasures. 

1 5 Since genetic data provide the underlying basis for identification of bioag^ by the 

methods of the present invention, it is prudent to select segments of micleic adds wfaidi ideally 
provide enough variability to distinguish each individual bioagent and whose molecular mass is 
amenable to molecular mass determination. In one embodiment of the present invention, at least 
one polynucleotide segment is amplified to facilitate detection and analysis in the process of 

20 identiiying the bioagent. Thus, the nucleic acid segments that provide enough variability to 
distinguish each individual bioagent and whose molecular masses are amenable to molecular 
mass determination are herein described as "bioagent identifying amplicons." The term 
"amplicon" as used herein, refers to a segment of a polynucleotide which is amplified in an 
amplification reaction. In some embodiments of the present invention, bioagent identifying 

25 amplicons comprise fi-om about 45 to about 1 50 nucleobases (i.e. from about 45 to about 1 50 
linked nucleosides). One of ordinary skill in the art will appreciate that the invention embo^es 
compounds of 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 
67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 
93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107. 108, 109, 1 10, 1 1 1, 1 12, 113, 

30 114, 115, 116, 117, 118, 119, 120. 121, 122, 123. 124. 125, 126, 127, 128, 129, 130, 131, 132, 
133, 134, 135, 136, 137,. 138, 139, 140,:141; 142, 143, 144, 145, 146, 147, 148, 149, and 150 
nucleobases m lengdi. 

As used herein, "intelligent primers" are primers that are designed to bind to highly 
conserved sequence regions that flank an intervenmg variable region and yield amplification 
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products which ideally provide enough variability to distinguish each individual bioagent, and 
which are amenable to molecular mass analysis. By flie tenn "highly conserved," it is meant that 
the sequence regions radiibit between about 80-100%, or between about 90-100%, or between 
about 95-100% identity. The molecular mass of a given amplification product provides a means 
S of identifying the bioagent fiom vtdiich it was obtained, due to the variability of the variable 
region. Thus, design of mtelligent primers involves selection of a variable region widi 
appropriate variability to resolve fhe identity of a particular bioi^ent It is fbe combination of the 
portion of the bioagent nucldc acid molecule sequence to wiiich the intelligent primers hybridize 
and die intervening variable region Htst makes up the bioagent identifying amplicoa Alternately, 

10 it is the intetvenmg variable region by itself tiiat makes up the bioagent identifying amplicoa 
It is understood m ^ art that fba sequence of a primer need not be 1 00% 
complementary to that of its target nucleic acid to be specifically hybridizable. Moreover, a 
primer may hybridize over one or more segments such that intervening or adjacent segments are 
not involved in the hybridization event (e.g., a loop structure or hairpin structure). The primers of 

15 the present invention can comprise at least 70%, at least 75%, at least 80%, at least 85%, at least 
90%, at least 95%, or at least 99% sequence complementarity to the target region within the 
highly conserved re^on to which they are targeted. For example, an intelligent primer wherein 
18 of 20 nucleobases are complementary to a highly conserved region would represent 90 
percent complementarity to the highly conserved region. In this example, the remaining 

20 noncomplementary nucleobases may be clustered or interspersed with complementary 
nucleobases and need not be contiguous to each other or to complementary nucleobases. As 
such, a primer which is 18 nucleobases in length having 4 (four) noncomplementary nucleobases 
which are flanked by two regions of complete complementarity with the hi^y conserved region 
would have 77.8% overall complememarity with tiie highly conserved region and would thus Ml 

25 within the scope of the present inventioa Percent complementarity of a primer with a region of a 
target nucleic acid can be detennined routinely usmg BLAST programs (basic local alignment 
search tools) and PowerBLAST programs known in the art (Altschul et al., J. Mol. Biol., 1 990, 
215, 403-410; Zhang and Madden, Genome Res., 1997, 7, 649-656). 

Percent homology, sequence identity or complementarity, can be detennined by, for 

30 example, the Gap program (Wisconsm Sequence Analysis Package, Version 8 for Unix, Genetics 
Computet Group, IMv^rsity Research Park, Madison WI), usk^ de&iilt settings, vk^cb uses tiie 
algorithm of Smitii and Waterman (Adv. Appl. Math., 1981, 2, 482-489). In some embodiments, 
complementarity of intelligent primers, is between about 70% and about 80%, In oAer 
embodiments, homology, sequence identity or complementarity, is between about S0% and 
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about 90%. In yet other embodiments, homology, sequence identity or complementarity, is 
about 90%, about 92%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99% 
or about 100%. 

The intelligent primers of this invention comprise from about 12 to about 35 
5 nucleobases (i.e. fiom about 12 to about 3S linked nucleosides). One of orcUnaiy skill in the art 
will appreciate that the invention embodies compounds of 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34. or 35 nucleobases in length. 

One havmg skill in the art amied with the prefored bioagent identifying amplicons 
defined by fbs primers ilhistrated herein will be able, without undue eiqierimentation, to identify 

10 additional intelligent primers. 

In one embodiment, the bioagent identifying amplicon is a portion of a ribosomal RNA 
(rRNA) gene sequence. WUh the complete sequences of many of the smallest microbial genomes 
now available, it is possible to identify a set of genes that defines •'munmal life" and idwitify 
composition signatures tiiat uniquely identify each gene and organism. Genes that encode core 

15 life functions such as DNA replication, transcription, ribosome structure, translatioi^ and 

transport are distributed broadly in the bacterial genome and are siiitable regions for selection of 
bioagent identifying amplicons. Ribosomal RNA (rRNA) genes comprise regions that provide 
useful base composition signatures. Like many genes involved in core life functions, rRNA 
genes contain sequences that are extraordinarily conserved across bacterial domains interspersed 

20 with regions of high variability that are more specific to each species. The variable regions can 
be utilized to build a database of base composition signatures. The strategy mvolves creating a 
structure-based alignment of sequences of the small (16S) and the large (23S) subunits of the 
rRNA genes. For example, there are currently over 13,000 sequences m tiie ribosomal RNA 
database that has been created and maintained by Robin Gutell, University of Texas at Austin, 

25 and is publicly available on the Institute for Cellular and Molecular Biology web page on the 
world wide web of the Internet at, for example, *Yna.icmb.utexas.edu/." There is also a publicly 
available rRNA database created and maintained by ttie University of Antwerp, Belgium on the 
world wide web of Ae Internet at, for example, "iina.uia.ac.be." 

These databases have been analyzed to determine regions that are useful as bioagent 

30 identifymg amplicons. The characteristics of such regions include: a) between about 80 and 
1 00%, or greater dian about 95% identity among species of the particular bioagent of interest, of 
upstream and downstream nucleotide sequences which serve as sequence amplification primer 
sites; b) an intervening variable region which exhibits no greater tium about 5% identity among 
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Species; and c) a separation of between about 30 and 1000 nucleotides, or no more than about 
50-250 nucleotides, or no more than about 60-100 nucleotides, between the conserved regions. 

As a non-limiting example, for id»itification of Bacillus species, the conserved 
sequence regions of tiie chosen bioagent identifying amplicon must be highly conserved among 
5 all Bacillus species while the variable region of the bioagent identifying amplicon is sufficientiy 
variable such that the molecular masses of the amplification products of all species of Bacillus 
ate distinguishable. 

Bioagent identifying amplicons amenable to molecular mass determination are eith^ of 
a length, size or mass compatible with the particidar mode of molecular mass determination or 
10 compatible with a means of providing a predictable fiagmentation pattern in order to obtain 
predictable fragments of a length compatible with the particular mode of molecular mass 
determination. Such means of providing a predictable firagmentation pattern of an amplification 
product include, but are not limited to, cleavage with restriction enzymes or cleavage primers, 
for example. 

1 5 Identification of bioagents can be accomplished at difiFerent levels using intelligent 

primers suited to resolution of each individual level of identification. "Broad range survey" 
intelligent primers are designed with the objective of identifying a bioagent as a member of a 
particular division of bioagents. A 'Tjioagent division" is defined as group of bioagents above the 
species level and includes but is not limited to: orders, families, classes, clades, genera or other 

20 such groupings of bioagents above die species level. As a non-limiting example, members of the 
Bacilltis/Clostridia group or gamma-proteobacteria group may be identified as such by 
employing broad range survey intelligent primers such as primers that target 16S or 23S 
ribosomal KNA. 

Li some embodiments, broad range survey intelligent primers are capable of 
25 identification of bioagents at tiie species level. One main advantage of the detection methods of 
the present invention is that the broad range survey intelligent primers need not be specific for a 
particular bacterial species, or even genus, such as Bacillus or Streptomyces. Instead, the primers 
recognize highly conserved regions across himdreds of bacterial species including, but not 
limited to, tiie species described h^in. Thus, the same broad range survey intelligent primer 
30 pair can be used to identify any desned bacterium because it wUl bmd to tiie conserved regions 
^t flank a variable region specific to a siiigle species. Or cominbn to several bacterial species, 
allowing unbiased nucleic acid amplification of the intervening sequence and determination of its 
molecular weight and base composition. For example, the 16S_971-1062, 16S_1228-1310 and 
16S_1 100-1188 regions are 98-99% conserved in about 900 species of bacteria (16S=16S rRNA, 
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numbers indicate nucleotide position). In one embodiment of the present invention, piimeis used 
in the present method bind to one or more of these regions or portions tbsreot 

Due to their overaU conservation, the flanking rRNA primer sequences serve as good 
intelligent primer binding sites to amplify the nucleic acid region of interest for most, if not all, 
5 bacterial species. The intervening region between &e sets of primers varies in length and/or 
composition, and thus provides a unique base composition signature. Examples of intelligent 
primers that amplify regions of the 16S and 23S rRNA are shown in Figures 1 A-IH. A typical 
prima: amplified region in 1 6S rRNA is shown in Figure 2. The arrows represent primers that 
bind to highly conserved regions that flank a variable region in 16S rRNA domain ID. The 

10 amplified region is the stem-loop structure under "^l 100-1188." It is advantageous to design the 
broad range survey intelligent primers to minimize the number of primMs required for the 
analysis, and to allow detection of multiple membos of a bioagent division using a sin^e pair of 
primers. The advantage of using broad range survey intelligent primers is that once a bioagent is 
broadly identified, the process of fiirther identification at species and sub-species levels is 

IS &cilitated by directing ttie choice of additional intelligent primers. 

"Division- wide" intelligent primers are designed with an objective of identifying a 
bioagent at the species level. As a non-limiting example, a Bacillus anthracis, Bacillus cereus 
and Bacillus thuringiensis can be distinguished firom each other using division-wide intelligent 
primers. Division-wide intelligent pruners are not always reqmred for identification at the 

20 species level because broad range survey intelligent primers may provide sufRdent identification 
resolution to accomplishing this identification objective. 

"Drill-down" intelligent primers are designed with an objective of identifying a sub- 
species characteristic of a bioagent. A "sub-species characteristic" is defined as a property 
imparted to a bioagent at tiie sub-species level of identification as a result of tiie presence or 

25 absence of a particular segment of nucleic acid. Such sub-species characteristics include, but are 
not limited to, strains, sub-types, pathogenicity markers such as antibiotic resistance genes, 
pathogenicity islands, toxin genes and virulence fectors. Identification of such sub-species 
characteristics is often aitical for determining proper clinical treatmait of pathogen infections. 
ChemicaLModificattons of Intelligent Primers 

30 Ideally, intelligent primer hybridization sites are highly conserved in order to fecilitate 

the hybridization of the primer. In'cases vrfiere primer hybridization is less'effident due to lower 
levels of conservation of sequence, mtelligent primers can be chemically modified to hnpiove 
the efficiency of hybridization. 
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For example, because any variation (due to codon wobble in tiie 3"* position) in Aese 
conserved regions among species is likely to occur in the third position of a DNA triplet, 
oligonucleotide primers can be designed such that the nucleotide corresponding to this position is 
a base vAdch can bind to more than one nucleotide, referred to herein as a •'universal base." For 
5 example, under this "wobble" pauing, inosme (I) bmds to U, C or A; guanine (G) binds to U or 
C, and uridme (U) binds to U or C. Oflier ocamples of universal bases include nitroindoles such 
as 5-ni1ioindole or 3-mtropyirole ^akes et al.. Nucleosides and Nucleotides, 199S, 14, 1001- 
1003), the degenerate nucleotides dP or dK (EliU et ctl.\ an acyclic nucleoside analog containing 
5-nitroindazole (Van AeischotetaL, Nucleosides and Nucleotides, 1995, 14, 1053-1056) or the 

10 purine analog l*<2-deoxy-p-D-ribofuranosyl)-iniidazole-4-carboxamide (Sala et al., NucL Acids 
Res., 1996, 24, 3302-3306). 

In another embodiment of flie invention, to compensate for the somev^ weaker 
binding by the "wobble" base, tilie oligonucleotide primers are designed such Aat the first and 
second positions of each triplet are occupied by nucleotide analogs viuch bind with greater 

15 affinity than the unmodified nucleotide. Examples of tiiese analogs include, but are not limited 
to, 2,6-diamiDopurine which binds to thymine, piopyne T which binds to adenine and propyne C 
and phenoxazines, including G-clamp, which binds to G. Propynylated pyrimidines are described 
in U.S. Patent Nos. 5,645,985, 5,830,653 and 5,484,908, each of which is commonly owned and 
incorporated herein by reference, in its entirety. Propynylated prhners are claimed m U.S Serial 

20 No. 10/294,203 which is also commonly owned and incorporated herein by reference in entirety. 
Phenoxazines are described in U.S. Patent Nos. 5,502,177, 5,763,588, and 6,005,096, each of 
which is incorporated herein by reference in its entirety. G-clamps are described in U.S. Patent 
Nos. 6,007,992 and 6,028,183, each of 'w^ich is incorporated herem by reference in its entirety. 
A theoretically ideal bioagent detector would identify, quantify, and report the complete 

25 nucleic acid sequence of every bioagrat ftat reached the sensor. The complete sequence of the 
nucleic acid component of a pafliogen would provide all relevant information about die threat, 
includmg its identity and the presence of drug-resistance or pathogenicity markers. This ideal has 
not yet been achieved. However, the present invention provides a straightforward strategy for 
obtaining mformation with the same practical vahie based on analysis of bioe^ent identifying 

30 amplicons by mdecular mass determination. 

In some cases, a moleciilar mass of a given bioagent identifying amplicon alone does 
not provide enou^ resolution to unambiguously identify a given bioagent For example, the 
molecular mass of the bioa^t identifying amplicon obtained using the intelligent primer pair 
"16S_97r would be 55622 Da for bothZ coli and Salmonella typhlmuritm. However, if 
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additional intelligent primers are employed to analyze additional bioagent identifying amplicons, 
a "triangulation identification" process is enabled. For example, the "16S_1 100" intelligent 
primer pair yields molecular masses of 55009 and 55005 Da for E. coli and Salmonella 
typhinmrium, respectively. Furthermore, the "ISS.SSS" intelligent primer pair yields molecular 

5 masses of 42656 and 42698 Da for E. coli and Salmonella typhimurhan, respectively. In this 
basic example, die second and tbird intelligent primer pairs provided the additional 
"fingerprinting" capability or resolution to distinguish between the two bioagents. 

In another onbodim^ triangulation identification process is pursued by measuring 
signals fiom a plurality of bioagent identifying amplicons selected within multiple core genes. 

10 This process is used to reduce fidse negative and &lse positive signals, and enable reconstruction 
of the origin of hybrid or otherwise engineered bioagents. In this process, after identification of 
multiple core genes, alignments are created fiom nucleic add sequence databases. The 
alignments are then analyzed for regions of conservation and variation, and bioagent identifying 
amplicons are selected to distinguish bioagrats based on specific genomic differences. For 

15 example, identification of the three part toxin genes typical of B. anthracis (Bowen et al., J. 
Ah>1. Microbiol., 1999, 87, 270-278) in the absence of the expected signatures firan the B. 
anthracis genome would suggest a genetic engineering event. 

The triangulation. identification process can be pursued by characterization of bioagent 
identifying amplicons in a massively parallel fashion using the polymerase chain reaction (PGR), 

20 siich as multiplex PGR, and mass spectiometric (MS) methods. Sufficient quantities of nucleic 
acids should be present for detection of bioagents by MS. A wide variety of techniques for 
preparing large amounts of purified nucleic acids or firagments thereof are well known to those of 
skill in the art. PGR requires one or more pairs of oligonucleotide primers that bind to regions 
which flank the target sequence(s) to be amplified. These primers prime synthesis of a different 

25 strand of DNA with synthesis occurring in the direction of one primer towards the other primer. 
The primers, DNA to be eunplified, a thermostable DNA polymerase (e.g. Taq polymerase), the 
four deoxynucleotide triphosphates, and a buffer are combined to initiate DNA synthesis. The 
solution is denatured by heating, then cooled to allow annealing of newly added primer, followed 
by another round of DNA syntiiesis. This process is typically repeated for about 30 cycles, 

30 resulting in amplification of the target sequence. 

Altixough the use of PGR is suitable, other nucleic acid mplification teclmiques may 
also be used, including ligase chain reaction C^CR) and strand displacement amplification 
(SDA). The hig^-resolution MS technique allows separation of bioagent spectral lines fiom 
background spectral lines m highly chittered environments. 
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In anolher embodiment, tbe detection scheme for the PGR products generated fiom the 
bioagent(s) mcoipoiates at least duee features. First, the technique simultaneously detects and 
differentiates multiple (generally about 6-10) PGR products. Second, the technique provides a 
molecular mass diat uniquely identifies tiie bioag^ fiom die possible primer sites. Finally, die 

5 detection technique is tqiid, allowing multiple PGR reactions to be xun in parallel. 

Mass spectrometry CMS)-based detection of PGR products provides a means for 
determination of BGS that has several advantages. MS is intrinsically a parallel detection scheme 
without the need fi>r radioactive or flu(»escent labels, since evoy amplification product is 
identified by its molecular mass. The current state of die art in mass spectrometry is such diat 

10 less than femtomole quantities of material can be readily analyzed to afford mformation about 
the molecular contents of the sample. An accurate assessment of the molecular mass of the 
material can be quickly obtained, irrespective of whether the molecular weight of the sample is 
several hundred, or in excess of one hundred thousand atomic mass units (amu) or Daltons. 
Intact molecular ions can be generated from amplification products usmg one of a variety of 

15 ionization techniques to convert the sample to gas phase. These ionization methods include, but 
are not limited to, electrospray ionization (ES), matrix-assisted laser desoiption ionization 
(MALDI) and fast atom bombardment (FAB). For example, MALDI of nucleic acids, along with 
examples of matrices for use in MALDI of nucleic acids, are described in WO 98/54751 
(Genetrace, Inc.). 

20 In some embodiments, large DNAs and RNAs, or large amplification products 

tiiere&om, can be digested with restriction endonucleases prior to ionizatioa Thus, for example, 
an amplification product that was 1 0 kDa could be digested with a series of resbiction 
endonucleases to produce apanel of, for example, 100 Da fiagments. Restiiction endoimcleases 
and their sites of action are well known to the skilled artisaiu Tn ttiis manner!, mass spectrometry 

25 can be performed for the purposes of restriction mapping. 

Upon ionization, several peaks are observed finm one sample due to the formation of 
ions with different charges. Averaging the multiple readings of molecular mass obtained finm a 
single mass spectrum affords an estimate of molecular mass of the bioagent Electrospray 
ionization mass spectrometiy (ESI-MS) is particularly usefiil for very high molecular wei^t 

30 polymers such as proteins and nucleic acids having molecular weights greater tiian 10 kDa, since 
it yields a distribution of multiply-charged molecules of the sample wifliout causing a significant 
amount of fragmentation. 
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The mass detectors used in the meAods of the jHesent invention include, but are not 
limited to, Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR-MS), ion trap, 
quadrupole, magnetic sector, time of flight (TOP), Q-TOF, and triple quadrupole. 

In general, the mass spectrometric techniques -which can be used in the present 
5 invention include, but are not limited to, tandem mass spectrometry, infiared multiphoton 
dissociation and pyrolytic gas chiomatogiaphy mass spectrometry (PGC-MS). In one 
embodiment of the invention, the bioagent detection system operates continually in bioagent 
detection mode using pyrolytic GC-MS without PGR for rapid detection of mcreases in Womass 
(for example, increases in fecal contamination of drinking water or of germ warfere agents). To 
10 aclueve minimal latency, a continuous sample stream flows directly into the PGC-MS 

combustion chamber. When an increase in biomass is detected, a PGR process is automatically 
hutiated. Bioagent presence produces elevated levels of large molecular fiagments from, for 
example, about 100-7,000 Da which are observed in the PGC-MS spectrum. The observed mass 
spectrum is compared to a threshold level and when levels of biranass are detemnned to exceed a 
15 predetennined tiireshold, the bioagent classification process described heremabove (combining 
PGR and MS, such as FT-ICR MS) is initiated. Optionally, alarms or otiier processes (halting 
ventilation flow, physical isolation) are also initiated by fliis detected biomass level. 

The accurate measurement of molecular mass for large DNAs is lunited by the 
adduction of cations from the PGR reaction to each strand, resolution of tiie isotopic peaks fix>m 
20 natural abundance and isotopes, and assignment of the charge state for any ion. The 
cations are removed by in-line dialysis using a flow-tiirough chip tiiat brings the solution 
containing tiie PGR products into contact wilh a solution containing ammonium acetate in tiie 
presence of an electric field gradient orthogonal to tiie flow. The latter two problems are 
addressed by operating witii a resolving power of >100,000 and by incorporating isotopically 
25 depleted nucleotide triphosphates into tiie DNA. The resolving power of the instrument is also a 
consideration. At a resolving power of 10,000, tiie modeled signal fiom tiie [M-14H+]''*" charge 
state of an 84mer PGR product is poorly characterized and assignment of tiie charge state or 
exact mass is impossible. At a resolving power of 33,000, tiie peaks from tiie individual isotopic 
components are visible. At a resolvmg power of 100.000, tiie isotopic peaks are resolved to tiie 
30 baseline and assi^ent of tiie charge state for the ion is straightforward. The ["c,*^-depleted 
triphospliates are obtained, for example, by growuig microorganisms on dq)leted media and 
harvesting tiie nucleotides (Batey et aL, Nucl. Acids Res., 1992, 20, 4515-4523). 

While mass measurements of intact nucleic acid regions are beUeved to be adequate to 
determme most bioagents, tandem mass spectrometry 0*^") techniques may provide more 
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definitive infonnation pertaining to molecular identity or sequence. Tandem MS involves the 
coupled use of two or more stages of mass analysis vibsK both the separation and detection steps 
are based on mass spectrometry. The first stage is used to select an ion or componoit of a sample 
fiom which fivther structural information is to be obtained. The selected ion is then fiagmented 

5 using, e.g., blackbody uradiation, mfiared multipboton dissociation, or collisional activation. For 
example, ions generated by electrospray ionization (ESQ can be fiagmented usmg IR 
multipboton dissociation. This activation leads to dissociation of glycosidic bonds and the 
phosphate backbone, producing two series of fragment ions, called the vc-series (having an intact 
3' terminus and a 5' phosphate following internal cleavage) and the a>Base series (having an 

10 intact 5' terminus and a 3' furan). 

The second stage of mass analysis is then used to detect and measure the mass of these 
resulting fragments of product ions. Such ion selection followed by fragmentation routmes can 
be performed mtdtiple times so as to essentially completely dissect the molecular sequence of a 
sample. 

15 If there are two or more targets of similar molecular mass, or if a single amplification 

reaction results in a product that has the same mass as two or more bioagent reference standards, 
they can be distinguished by using mass-modifying "tags." In this embodiment of tibe invention, 
a nucleotide analog or "tag" is mcoiporated during amplification (e.g., a 5-(trifluoromethyl) 
deoxythymidine triphosphate) which has a different molecular weight than the unmodified base 

20 so as to improve distinction of masses. Such tags are described in, for example, PCT 

WO97/33000, which is incorporated herem by reference in its entirety. This further limits die 
numb^ of possible base compositions consistent with airy mass. For example, S- 
(trifluoromediyl)deoxytiiymidine triphosphate can be used in place of dTTP m a separate nucleic 
acid amplification reaction. Measurement of tfie mass shift between a conventional amplification 

25 product and the tagged product is used to quantitate die number of tiiymidine nucleotides in each 
of die single strands. Because the strands are complementary, the number of adenosuie 
nucleotides in each strand is also detrammed. 

hi anotiier amplification reaction, die number of O and C residues m each strand is 
detennmed usmg, for example, the cgrtiduie analog 5-mefliylcytosine (5-meC) or propyne C. The 

30 combmation of tiie A/T reaction and G/C reaction, followed by molecular weight determination, 
provides a unique b^e composition, t^s mediod is simimarized in Fig^ 
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The mass tag phosphorothioate A (A*) was used to distinguish a Bacillus anthracis 
cluster. Hie B. anthracis (AJ4G9C14T9) had an average MW of 14072.26, and the B. anthracis 
5 (AiA*i3G9CuT9) had an average molecular vireight of 14281.1 1 and tiie phosphorothioate A had 
an average molecular weight of +16.06 as determined by ESI-TOF MS. The deconvoluted 
spectra are shown in Figure 5. 

In another example, assume the measured molecular masses of each strand are 
30,000.115Da and 31,000.1 15 Da respectively, and the measured number of dl and dA residues • 
10 are (30,28) and (28,30). If the molecular mass is accurate to 100 ppm, there are 7 possible 
combinations of dG+dC possible for each strand. However, if the measured molecular mass is 
accurate to 10 ppm, there are only 2 combmations of dG+dC, and at 1 ppm accuracy there is 
only one possible base composition for each strand. 

Signals fix>m the mass spectrometer may be input to a maximum-likelihood detection 
1 5 and classification algorithm such as is widely used in radar signal processing. The detection 
processing uses matdied filtering of BCS observed in mass-basecount space and allows fi)r 
detection and subtraction of signatures fiwm known, harmless organisms, and for detection of 
unknovm bioagent threats. Comparison of newly observed bioagents to known bioagents is also 
possible, for estimation of threat level, by compaiiiig iflieir BCS to those of laibwn organisms and 
20 to known forms of pathogenicity enhancement, such as insertion of antibiotic resistance genes or 
toxin goiBs. 
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Processing may end with a Bayesian classifier using log likelihood ratios developed 
fix>m the observed signals and average background levels. The program onphasizes performance 
{uredictions culminating in probability-of-detection versus probability-of-&lse-alann plots for 
conditions invohdng complex backgrounds of naturally occurring organisms and environmental 
5 contaminants. Matched filt^ consist of a priori expectations of signal values given flie set of 
primers used for each of the bioagents. A genomic sequence database (e.g. GenBank) is used to 
define the mass basecount matched filters. The database contains known threat agents and benign 
background organisms. The latter is used to estimate and siibtract the signature produced by the 
background organisms. A maximum likelihood detection of known background organisms is 
10 unplemented usii^ matched filters and a running-sum estunate of the noise covariance. 
Background signal strengths are estimated and used along wiA the matched filters to fomi 
signatures that are then subtracted. The maximum likelihood process is applied to this "cleaned 
up" data in a similar manner employing matched filters for the organisms and a running-sum 
estimate of the noise-covaiiance for the cleaned up data. 
1 5 Although the molecular mass of amplification products obtained using intelligent 

primers provides a means for identification of bioagents, conversion of molecular mass data to a 
base composition signature is usefiil for certain analyses. As used herein, a "base composition 
signature" (BCS) is the exact base composition determined from the molecular mass of a 
bioagent identifying amplicon. In one embodiment, a BCS provides an index of a specific gene 
20 in a specific organism. 

Base compositions, like sequences, vary slightiy fi»m isolate to isolate within species. It 
is possible to manage this diversity by building "base composition probability clouds" around the 
composition constraints for each species. This pennits identification of organisms in a fashion 
similar to sequence analysis. A "pseudo four-dimensional plof can be used to visualize the 
25 concept of base composition probability clouds (Figure 1 8). Optimal primer design requkes 
optimal dioice of bioagent idoitifying amplicons and maximizes the separation between the base 
composition signatures of uidividual bioagents. Areas where clouds overly indicate regions tiiat 
may result in a ixusclassification, a problem which is overcome by selecting primers that provide 
information fiom different bioagent identifymg amplicons, ideally maximLdng the separation of 
30 base compositions. Thus, one aspect of the utility of an analysis of base composition pnibability 
clouds is tiiat it provides a means for screening priiner se6 in ordei: to avoid potential 
misclassificadons of BCS and bioagent idoitity. Another aspect of the utiUty of base 
composition probability clouds is that they provide a means for predicting the identity of a 
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bioagent whose exact measured BCS was not previously observed and/or indexed in a BCS 
database due to evolutionary transitions in its nucleic add sequence. 

It is important to note tliat, in contrast to probe-based techniques, mass spectrometry 
determination of base composition does not require prior knowledge of the composition in order 
5 to make the measurement, only to interpret Ifae results. In this regard, the present invention 
provides bioagent classifying information similar to DMA sequencing and phylogenetic analysis 
at a level sufScient to detect and identify a given bioagoit Furthermore, the process of 
determination of a previously unknown BCS for a given bioagent (for example, in a case where 
sequence information is unavailable) has downstream vShty by providing additbnal bioagent 
10 indexing information with which to populate BCS databases. The process of future bios^gent 
identification is tiius greatly unproved as more BCS indexes become available in the BCS 
databases. 

Another embodiment of the present invention is a me^od of surveying bioagent 
samples that oiables detection and idoitification of all bacteria for which sequence information 

15 is available using a set of twehre broad-range intelligent PGR primers. Six of the twelve primers 
are "broad range survey primers" herdn defined as primers targeted to broad divisions of 
bacteria (for example, the Bactlhis/Clostridia group or gamma-proteobacteria). The other six 
primers of the group of twelve primers are "division-wide" primers herein defined as primers 
that provide more focused coverage and higher resolution. This method enables identification of 

20 nearly 100% of known bacteria at the species level. A further example of this embodiment of the 
present invention is a method herein designated "survey/drill-down" wherein a subspecies 
characteristic for detected bioagents is obtained using additional primers. Examples of such a 
subspecies characteristic include but are not limited to: antibiotic resistance, pathogenicity 
island, virulence factor, strain type, sub-species type, and clade group. Using the survey/drill- 

25 down method, bioagent detection, confirmation and a subspecies characteristic can be provided 
within hours. Moreover, the survey/drill-down method can be focused to identify bioraigineering 
events such as the insertion of a toxin gene mto a bacterial species tihat does not normally make 
the toxin. 

The present methods allow extremely rapid and accurate detection and identification of 
30 bioagents compared to existing methods. Furthennore, this rapid detection and identification is 
possible even v/bea sample ma^fial is' unpiiie. The methodsleverage origomg biomedical - - 
research in vurulence, pathogenicity, drug resistance and genome sequencing into a mediod 
which provides greatly improved sensitivity, specificity and reliability compared to existing 
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mediods, with lower rates of false positives. Tbm, the mettiods are useiiil ia a wide variety of 
fields, including, but not limited to, fboss fields discussed below. 

In other embodiments of the invention, the methods disclosed herein can identify 
infectious agents in biological samples. At least a first biological sample conteiinlng at least a 
5 first unidentified infisctious agent is obtained. An identification analysis is canied out on the 
sample, v/hsKby die first infectious agoit in the first biological sample is identified. More 
particularly, a method of identifying an infectious agent in a biological eatity is provided. An 
identification analysis is carried out on a first biological sample obtained fiom the biological 
entity, vdiereby at least one infectious agent in die biological sample from the biological entity is 

10 identified. The obtaining and the performing steps are, optionally, repeated on at least one 
additional biological sample firom the biological entity. 

The present invention also provides methods of identifying an infectious agent that is 
potentially the cause of a health condition in a biological entity. An identification analysis is 
carried out on a first test sample from a first infectious agent differentiating area of the biological 

1 5 entity, whereby at least one infectious agent is idmtified. The obtaining and the performing steps 
are, optionally, repeated on an additional infectious agent differentiating area of die biological 
entity. 

Biological samples include, but are not limited to, hair, mucosa, skin, nail, blood, saliva, 
rectal, lung, stool, urine, breath, nasal, ocular sample, or the like, hi some embodiments, one or 

20 more biological samples are analyzed by the methods described herein. The biological sample(s) 
contain at least a first unidentified infectious agent and may contain more than one infectious 
agent. The biological sBmple(s) are obtained fiom a biological entity. The biological sample can 
be obtained by a variety of manners such as by biopsy, swabbing and the like. The biological 
samples may be obtfuned by a physician in a hospital or other healtii care environment The 

25 physician may then perform the identification analysis or send fi]e biological sample to a 
laboratory to cany out the analysis. 

Biological entities mclude, but are not limited to, a mammal, a bird, or a reptile. The 
biological entity may be a cow, horse, dog, cat, or a primate. The tnological entity can also be a 
hiunan. The biological entity may be living or dead. 

30 An infectious agent differentiating area is any area or location withm a biological entity 

that can distinguish between a harinfld versus normil health condition. An infectious agent 
differentiating area can be a region or area of the biolo^cal entity whereby an infectious agent is 
more likely to predominate from another region or area of the biological entity. For example, 
infectious agent differentiating areas may include die blood vessels of the heart (heart disease, 
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coronary artery disease, etc.). particular portions of the digestive system (ulcers, Crohn's disease, 
etc.), liver (hepatitis infections), and the like. In some embodiments, one or more biological 
samples from a plurality of infectious agrait differentiating areas is analyzed the methods 
desoibed herein. 

5 Infectious agents of the invention may potentially cause a health condition in a 

biological entity. Health conditions include any condition, syndrome, illness, disease, or tiie like, 
identified currentiy or in tiie future by medical personnel. Infectious agents include, but are not 
limited to, bacteria, viruses, parasites, fungi, and the like. 

In other embodiments of Has invention, the methods disclosed herein can be used to 

1 0 screen blood and other bodily fluids and tissues for pathogenic and non-pathogenic bacteria, 
viruses, parasites, fimgi and the like. Annual samples, including but not limited to, blood and 
other bodily fluid and tissue samples, can be obtained &om living animals, -who are eitiier known 
or not known to or suspected of having a disease, infection, or conditioa Alternately, animal 
samples such as blood and other bodily fluid and tissue samples can be obtained fix>m deceased 

1 5 animals. Blood samples can be fiirther separated into plasma or cellular fractions and furtiier 
soeened as desired. Bodily fluids and tissues can be obtained from any part of the animal or 
human body. Animal samples can be obtamed from, for example, mammals and humans. 

Clinical samples are analyzed for disease causing bioagents and biowar&re pathogens 
simultaneously with detection of bioagents at levels as low as 100-1000 genomic copies in 

20 complex backgrounds with throughput of jqpproximately 100-300 samples with simultaneous 
detection of bacteria and viruses. Such analyses provide additional value in probing bioagent 
genomes for unanticipated modificeitions. These analyses are carried out in reference labs, 
hospitals and the LRN laboratories of the public health system in a coordinated fashion, with the 
ability to report the results via a computer network to a common data-monitoring center in real 

25 time. Clonal propagation of specific infectious agents, as occurs in tiie epidemic outbreak of 
infectious disease, can be tracked witili base composition signatures, analogous to the pulse field 
gel electrophoresis fingerprinting patterns used in tracking the spread of specific food pathogens 
in the Pulse Net system of the CDC (Swaminathan et al.. Emerging Infectious Diseases, 2001, 7, 
382-389). The present invention provides a digital barcode in the form of a series of base 

30 composition signatures, the combination of which is unique for each known or^nism. This 
capabilify enables real-time infectious disease monitoriiig across broad gebgr^hic locations, 
which may be essential in a simultaneous outbreak or attack in different cities. 

In other embodiments of the invention, the metiiods disclosed herein can be used for 
detecting the presence of pathogenic and non-patiiogenic bacteria, viruses, parasites, fimgi and 
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the like in organ donors and/or in organs from donors. Such examination can result in the 
prevoation of the transfer of, for example, viruses sudi as West Nile virus, hepatitis viruses, 
human unmunodeficiency vims, and the like fix)m a donor to a recipient via a transplanted organ. 
The methods disclosed herein can also be used for detection of host versus graft or graft versus 
5 host rejection issues related to organ donors by detecting the presence of particular antigens in 
either Qie graft or host known or suspected of causing such rejection. In particular, the bioagrats 
in tills regard are the antigens of the major lustocompatibility complex, such as the HLA 
antigens. The present metiiods can also be used to detect and track emerging infectious diseases, 
such as West Nile virus infection, HIV-related diseases. 

1 0 In other embodiments of the mvention, tiie methods disclosed herein can be used for 

phannacogenetic analysis and medical diagnosis including, but not limited to, cancer diagnosis 
based on mutations and polymorphisms, drug resistance and susceptibility testing, screening for 
and/or diagnosis of genetic diseases and conditions, and diagnosis of infectious diseases and 
conditions. In context of the present invention, phamiacogenetics is defined as the study of 

1 5 variability in drug response due to genetic factors. Phaimacogenetic investigations are often 
based on correlating patient outcome with variations in genes involved in the mode of action of a 
given drug. For example, receptor genes, or genes involved in metabolic pathways. The methods 
of the present invention provide a means to analyze the DNA of a patient to provide the basis for 
phannacogenetic analysis. 

20 The present method can also be used to detect single nucleotide polymorphisms (SNPs), 

or multiple nucleotide polymorphisms, rapidly and accurately. A SNP is defined as a single base 
pair site in the genome that is different from one mdividual to another. The differoice can be 
expressed eitiier as a deletion, an msertion or a substitution, and is frequently liiiked to a disease 
state. Because they occur every 100-1000 base pairs, SNPs are the most fiequentiy bound type of 

25 genetic marker in the human genome. 

For ocample, sickle cell anemia results finm an A-T transition, vAach encodes a valine 
rather than a glutamic acid residue. Oligonucleotide primers may be designed such that tiiey bind 
to sequences tiiat flank a SNP site, followed by nucleotide amplification and mass determination 
of the amplified product Because the molecular masses of the resulting prodiict finm an 

30 mdividual who does not have sickle cell anemia is different from tiiat of tiie product tmm an 
individiial wiio has 'the disease, the method can be used to distinguish the two individuals. Thus, 
tiie metiiod can be used to detect any known SNP in an individual and tiius diagnose or 
determine increased susceptibility to a disease or conditioa 
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In one embodiment, blood is drawn from an individual and peripheral blood 
mononuclear cells (PBMC) are isolated and simultaneously tested, such as in a high-throughput 
screening metiliod, for oie or more SNPs usmg ^propriate primers based on i3ae known 
sequences v^iich flank the SNP region. The National Center for Biotechnology hifbrmation 
5 mamtains a publicly available database of SNPs on the world vide web of the htemet at, for 
ocample, "ncbijihnjiih.gov^M>/." 

Ihte method of the present invention can also be used for blood typing. The gene 
encoding A, B or O blood type can differ by four single nucleotide polymorphisms. If the gene 
contains the sequence COTOOTGACCCTT (SEQ ID NO:S), antigen A results. If the gene 

10 contains the sequence CXjTCGTCACCGCTA (SEQ ID N0:6) antigen B results. If the gene 
contains the sequence CXjTGGT-ACCCCTT (SEQ ID N0:7), blood group O lesuHs ("-" 
indicates a deletion). These sequences can be distinguished by designing a single primer pair 
which flanks tiiese regions, followed by amplification and mass detemunation. 

Tlie me&od of the present invention can also be used for detection and idoitification of 

1 5 blood-bome padiogens such as Steq)hyiococcus aureus for example. 

The method of the present invention can also be used for strain typing of respiratory pathogens 
in epidemic surveillance. Group A streptococci (GAS), or Streptococcus pyogenes, is one of the 
most consequential causes of respiratory infections because of prevalence and ability to cause 
disease with complications such as acute rheumatic fever and acute glomerulonephritis . GAS 

20 also causes infections of the skin (impetigo) and, in rare cases, invasive disease such as 

necrotizing fasciitis and toxic shock syndrome. Despite many decades of study, the underlying 
microbial ecology and natural selection that favors enhanced virulence and explosive GAS 
outbreaks is still poorly understood. The ability to detect GAS and multiple other pathogenic 
and non-pathogenic bacteria and viruses in patient samples would greatly facilitate our 

25 understanding of GAS epidemics. It is also essential to be able to follow the spread of virulent 
strains of GAS in populations and to distinguish virulent strains from less virulent or avirulent 
streptococci that colonize the nose and throat of asymptomatic individuals at a frequency ranging 
torn S-20% of the population ^isno, A. L. (1995) m Principles and Practice of Infectious 
Diseases, eds. Mandell. G, L., Bennett, J. E. & Dolin, R. (Churchill Livmgston, New York), Vol. 

30 2, pp. 1786-1799). Molecular metiiods have been developed to type GAS based upon the 
sequence of the emm gene tibat en(k>des the M-pibtein vinile^ 

Micro., 1996, 34. 953-958; Beall etaL. J. Clin. Micro., 1997, 35, 1231-1235; and Facklam et al.. 
Emerging Infectious Diseases, 1999, 5, 247-253). Using tiiis molecular classification, over 150 
different enun-types are defined and correlated with phenotypic properties of thousands of GAS 
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isolates (www.cdc.gov/ncidod/biotech/ strep/strepindexiiiml) (Facklam et al., Clinical Infectious 
Diseases, 2002, 34, 28-38). Recently, a strategy known as Multi Locus Sequence Typing 
(MLST) was developed to follow the molecular Epidemiology of GAS. In MLST, internal 
fiagments of seven housekeeping genes are amplified, sequenced, and compared to a database of 
5 previously studied isolates (www.testmlstnet/). 

The present invration enables an emm-typing process to be carried out directly ftom 
tiiroat swabs for a large number of samples within 12 hours, allowing strain tracking of an 
ongomg epidemic, even if geographically dispersed, on a larger scale than ever before 
achievable. 

10 In anotfao- embodiment, the present invention can be en^iloyed in the serotyping of 

viruses including but not limited to, adenoviruses. A(tenovinises are DNA viruses that cause 
over 50% of febrile respiratory illnesses in military recruits. Human adenoviruses are divided 
into six major serogroups (A through F), each containing multiple strain types. Despite the 
prevalence of adenoviruses, fliere are no rapid methods for detecting and serotyping 

15 adenoviruses. 

In another embodiment, the present invention can be employed in distingxiishing 
between members of the Orthopoxvirus genus. Smallpox is caused by the Variola virus. Other 
members of the genus include Vaccinia, Monkeypox, Camelpox, and Cowpox. All are capable of 
infecting humans, thus, a method cq)able of identifying and distinguishing among members of 

20 the Orthopox genus is a worthwiiile objective. 

. In another embodiment, the present invention can be employed in distinguishing 
between viral agents of viral hemonhagjc fevers (VHF). VHF agents include, but are not limited 
to, Filoviridae (Marburg virus and Ebola vmis),Arenaviridae (Lassa, Junin, Macfaupo, Sabia, 
and Guanarito viruses), Bunyaviridae (Crimean-Congo hemorrhagic fever virus (CCHFV), Rift 

25 Valley fever virus, and Hanta viruses), and Flavmridae O^eUow fever virus and dengue virus). 
Infections by VHF viruses are associated with a wide spectrum of clinical manifestations such as 
diarrhea, myalgia, coug^ headache, pneumonia, enceidialopathy, and hepatitis. Filoviruses, 
arenaviruses, and CCHFV are of particular relevance because they can be transmitted from 
human to human, thus causing epidemics with high mortality rates (Khan et al.. Am. J. Trop. 

30 Med. Hyg., 1997, 57, 519-525). In tiie absence of bleeding or organ manifestation, VHF is 
clmically difBcidt to diagnose, and- the various etiologic ageiits can haidly Kb distinguished by 
clinical tests. Current approaches to PCR detection of tiiese agents are time-consuming, as they 
include a sqwrate cDNA synthesis step prior to PCR, agarose gel analysis of PCR products, and 
in some instances a second round of nested amplification or Southern hybridization. FCRs for 
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different pathogens have to be run assay by assay due to differences in cycling conditions, which 
complicate broad-range testing in a short period. Moieovw, post-FCR processing or nested PGR 
steps included in currently used assays increase the risk of false positive results due to carryover 
contamination (Kwok et al.. Nature, 1989, 339, 237-238). 
S In another embodiment, the present invention, can be employed in the diagnosis of a 

plurality of etiologic agents of a disease. An "etiologic agent" is herein defined as a pathogen 
acting as the causative agent of a disease. Diseases may be caused by a plurality of etiologic 
agents. For example, recent studies have implicated both human herpesvirus 6 (HHV-6) and the 
obligate intiacellular bacterium CMamydia pneumoniae in the etiology of multiple sclerosis 

10 (Swanborg, Microbes and Infection, 2002, 4, 1327-1333). The present invention can be applied 
to the identification of multiple etiologic agents of a disease by, for example, the use of broad 
range bacterial intelligent primers and division-iwide primers Of necessary) for the identification 
of bacteria such as Chlamydia pnetanoniae followed by primers directed to vual housekeeping 
genes for tiie identification of viruses such as HHV-6, for example. 

IS In other embodimoits of ttie invention, the methods disclosed herein can be used for 

detection and identification of pathogens in livestock. Livestock includes, but is not limited to, 
cows, pigs, sheep, chickens, turkeys, goats, horses and other farm animals. For example, 
conditions classified by the California Department of Food and Agriculture as emergency 
conditions in livestock (www.cdfa.ca.gov/ahfss/ah/pdfs/CA_reportable_disease_list_ 

20 05292002.pdf) include, but are not limited to: Anthrax (Bacillus anthracis), Screwworm myiasis 
{Cochliomyia hominivorax or Chiysomya bezziam), Afiican trypanosomiasis (Tsetse fly 
diseases). Bovine babesiosis (piroplasmosis). Bovine spongiform encephalopathy (Mad Cow), 
Contagious bovine pleuropneumonia (Mycoplasma mycoides mycoides small colony), Foot-and- 
mouth disease (Hoof-and-mouth), Heartwater (Cowdria ruminantium). Hemorrhagic septicemia 

25 (Pasteurella multdcida serotypes B:2 or E:2), Lumpy skin disease, Malignant catarrhal fever 
(African type), Rift Valley fever. Rinderpest (Cattle plague), Theileriosis (Corridor disease. East 
Coast fever). Vesicular stomatitis. Contagious agalactia (Mycoplasma species). Contagious 
c^rine pleuropneumonia (Mycoplasma capricolvm capripneumoniae), Nairobi sheep disease, 
Peste des petits ruminants (Ooat plague), Pubnonary ad^omatosis (Viral neoplastic pneumonia), 

30 Salmonella abortus ovls. Sheep and goat pox, Afiican swine fever, Classical swine fever (Hog 
cholera), Japanese eiM^phaUtis, MpaU viriisi SwiiiB veacifla^ 

(Enterovirus encephalomyelitis). Vesicular exanthema. Exotic Newcastle disease (Viscerotropic 
velogenic Newcastie disease), Ifighly patfaogaiic avian influenza (Fowl plague), African horse 
sickness, Dourine (Trypanosoma equiperdum), EpiTootic lymphangitis (equine blastomycosis, 
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equine histoplasmosis). Equine piioplasmosis (Babesia equi. B. cabattf). Glanders (Farcy) 
(Pseudomonas mallei), Hendra virus (Equine morblllMrus), Horse pox. Surra (lYypanosoma 
evansi), Voiezuelan equine encephalomyelitis. West Nile Vims, Chronic wasting disease in 
cervids, and Vinl hemorrhagic disease of rabbits (calicivinis) 

5 Conditions clasafied by the CaMmua Department of Food and Agriculture as regulated 

conditions in livestock include, but are not limited to: rabies, Bovine brucellosis {Brucella 
abortus). Bovine tuberculosis (MycobacteWum bovis). Cattle scabies (multiple types), 
Trichomonosis (Tritrichomonas fetus). Caprine and ovine brucellosis (excluding Brucella ovis), 
Scrapie, Sheep scabies (Body mange) (Psoroptes ovls). Porcine brucellosis (Brucella suis), 

10 Pseudorabies (Aujeszky's disease). Ornithosis (Psittacosis or avian chlamycHosis) (Chlamydia 
psittaci), Pullorum disease (Fowl typhoid) (Salmonella gallinarum and pullorum). Contagious 
equine metritis (Taylorella equigenitalis). Equine encephalomyelitis (Eastern and Western 
equine encephalitis), Equine infectiotis anemia (Swamp fever). Duck viral enteritis (Duck 
plague), and Tuberculosis in cervids. 

1 5 Additional conditions monitored by the California Department of Food and Agriculture 

include, but are not limited to: Avian tuberculosis (Mycobacterium avium), 
Echinococcosis/Hydatidosis (Echinococcus species), Leptospirosis, An^lasmosis (Anaplasma 
marginak at A. centrale), Bluetongue, Bovine cysticercosis (Taenia saginata in humans). 
Bovine g^tal campylobacteriosis (Campylobacter fetus venerealis), Dermatophilosis 

20 (Streptothricosis, mycotic dermatitis) (Dermatophilus congolensis). Enzootic bovine leukosis 
(Bovine leukemia virus). Infectious bovine ihmotracheitis (Bovine heipesvinis-1), Johne's 
disease (Paxatuberculosis) (Mycobacterium avium paratuberculosis). Malignant cataniial fever 
(North American), Q Fever (Coxiella burnetii), Caprine (contagious) arthritis/encephalitis, 
Enzootic abortion of ewes (Ovine cMamydiosis) (Chlamydia psittaci), Maedi-Visna (Ovine 

25 progressive pneumonia). Atrophic rhinitis (Bordetella bronchiseptica, Pasteurella multocida), 
Porcme cysticercosis (Taenia solium m humans), Porcine reproductive and respiratory 
syndrome. Transmissible gastroenteritiis (coronavirus), Trichmellosis (Trichinella spiralis). 
Avian mfectious bronchitis, Avian in&ctious laryngotracheitis. Duck viral hepatitis, Fowl 
cholera (Pasteurella multocida). Fowl pox, hrfectious bursal disease (Gumboro disease). Low 

30 pathogenic avian influenza, Marek's disease, Mycoplasmosis (Mycoplasma gallisepticum), 
-Equine influenza Equine rhinopnexirnonitis ^iiine Kerpeswus-l), fiqmne viral arteritis, and 
Horse mange (multiple types). 

A key problem in determining that an infectious outbreak is the result of a bioterrorist 
attack is the sheer variety of organisms that might be used by terrorists. According to a recent 
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review (Taylor et al., PhUos. Trans. R. Soc. Lond. B. Biol. Sci., 2001, 356, 983-989), there ai« 
over 1400 or^nisms infectious to humans; most of these have the potential to be used m a 
deliberate, malicious attack. These numbers do not include numerous strain variants of each 
organism, bioengineered versions, or pa&ogens tb&t infect plants or animals. Paradoxically, most 
5 of the new technology bemg developed for detection of biological weapons incorporates a 
version of quantitative PGR, whidi is based upon tiie use of highly specific primers and probes 
designed to selectively identify specific pathogenic organisms. This approach requires 
assumptions about the type and strain of bacteria or vuus wfaidi is expected to be detected. 
Although tiiis approach will woric for the most obvious organisms, like smallpox and anthrax, 

10 experience has shown tiiat it is very difficult to anticipate what a terrorist will do. 

Hie present invention can be used to detect and identify any biological agent, including 
bactoia, viruses, fungi and toxms without prior knovidedge of the organism being detected and 
identified. As one example, -wbsK the agent is a biological tiireat, the information obtained such 

. as tite presence of toxin genes, patiiogenicity islands and antibiotic resistance genes for example, 

15 is used to determine practical infonnation needed fat countermeasures. hi addition, the metiiods 
can be used to identify natural or deliberate engineering events including chromosome fiagment 
swapping, molecular breeding (gene shuffling) and emer^g infectious diseases. The present 
invention provides broad-function technology that may be the only practical means for rapid 
diagnosis of disease caused by a biowarfare or bioterrorist attack, especially an attack that might 

20 otiierwise be missed or mistaken for a more common infection. 

Bacterial biological warfare agents capable of being detected by the present methods 
include, but are not limited to, Bacillus anthracis (anthrax). Yersinia pestis (pneumonic plague), 
Franciscella tularensis (tularemia). Brucella suis. Brucella abortus. Brucella melitemis 
(undulant fever), Burkholderia mallei (glanders), Burkholderia pseudomalleii (melioidosis), 

25 Salmonella typhi (typhoid fever), Rickettsia typhii (epidemic typhus), Rickettsia prawasehi 
(endemic typhus) and Coxiella burnetii (Q fever), Rhodobacter capsulatus, Chlamydia 
pneumoniae, Escherichia coli. Shigella dysenteriae. Shigella flexneri. Bacillus cereus, 
Clostridium botulinum, Coxiella humetti, Pseudomonas aerupnosa, Legionella pneumophila, 
and Vibrio cholerae. 

30 Besides 16S and 23S rRNA, other target regions suitable for use in the present mvention 

for detrotion of bacteria iaclude, but arehot liinited to, SSrRNAaik^ 

Fungal biowar&re agents include, but are not limited to, Coccidioides immitis 
(Coccidioidomycosis), and Magnaporthe grisea. 
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Biological 'wai&re toxin genes capable of being detected by the methods of fbe present 
invention inchide, but are not limited to, botulinum toxin, T-2 mycotoxins, licin, staph 
enteiotoxin B, shigatoxin, abrin, aflatoxin, Clostridium perfringens epsilon toxin, conotoxins, 
diacetoxyscirpenol, tetrodotoxin and saxitoxin. 
5 Parasites Haat could be used in biological warfiue include, but are not limited to: Ascaris 

suum. Giardia lamhlia, Cryptosporic&um, soA Schistosoma. 

Biological wufare viral threat agents are mostly RNA viruses (positive-strand and 
negative-strand), wi& the exception of smallpox. Every KNA virus is a fiunily of related viruses 
(quasispecies). These viruses mutate rapidly and the potential for engineered strains (natural or 

10 deliberate) is veiy high. RNA viruses cluster into &milies that have conserved RNA structural 
domains on the viral genome (e.g., virion components, accessory proteins) and conserved 
housekeeping genes that encode core viral proteins including, for single strand positive strand 
RNA viruses, RNA-dependent RNA polymerase, double stranded RNA helicase, chymotrypsin- 
like and papain-like proteases and methyltransferases. "Housekeeping genes" refers to genes that 

1 5 are generally always expressed and thought to be involved in routme cellular metabolism. 

Examples of (-)-strand RNA viruses include, but are not limited to, arenaviruses (e.g., 
sabia virus, lassa fever, Meichupo, Argentine hemorrhagic fever, flexal virus), bunyaviruses (e.g., 
hantavirus, nairovixus, phiebovirus, Hantaan virus, Congo-crimean hemorrhagic fever, rift valley 
fever), and mononegawales (e.g., filovirus, paramyxovirus, ebola vlras, Marburg, equine 

20 morbillivirus). 

Examples of (+)-strand RNA viruses include, but are not limited to, picomaviruses (e.g., 
coxsackievirus, echovirus, human coxsackievirus A, human echovirus, human enterovirus, 
human poliovirus, hepatitis A virus, human parechovurus, human riunovirus), astroviruses (e.g., 
human astrovirus), calciviruses (e.g., chiba virus, chitta vuns, human calcivmis, norwalk virus), 

2S nidovnales (e.g., hmnan coronavuus, human torovirus), flavhdruses (e.g., dengue virus 1-4, 
Japanese encephalitis virus, Kyanasur forest disease virus, Murray Valley encephalitis virus, 
Rocio vurus, St Louis encephalitis virus, West Nile virus, yellow fever virus, hepatitis c virus) 
and togaviruses (e.g., CUkogunya virus, Eastern equine encephalitis virus, Mayaro virus, 
O'nyong-nyong virus, Ross River virus, Venezuelan equine encephalitis virus. Rubella virus, 

30 hepatitis E virus). The hepatitis C virus has a S'-untranslated region of 340 nucleotides, an open 
reading frame encoding 9 proteins having 3010 ammo acids andTa 3'-uiitrahslated region of 240 
nucleotides. The 5'-UTR and 3*-UTR are 99% consaved in hepatitis C vfaruses. 

In one embodiment, the target gene is an RNA-dependent RNA polymerase or a 
helicase encoded by (+)-strand RNA viruses, or RNA polymerase from a (-)-strand RNA virus. 
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(+)-strand RNA viruses are double stranded RNA and replicate by RNA-directed RNA synthesis 
using RNA-dependent RNA polymerase and the positive strand as a template. Helicase unwinds 
the RNA duplex to allow replication of the single stranded RNA. These viruses include vkuses 
from the femily picomaviridae (e.g., poliovttus, coxsackievirus, echovmis), togaviridae (e.g., 
S alphavirus, flavivirus, rubivhus), arenaviridae (e.g., lymphocytic choriomeningitis virus, lassa 
fever virus), cononavuidae (e.g., human lespiratray virus) and Hepatitis A virus. The genes 
encoding these proteins comprise variable and highly conserved regions that flank the variable 
regions. 

In one embodiment, the method can be uised to detect Has presence of antibiotic 
10 resistance and/or toxin genes m a bacterial species. For example. Bacillus anthraas comprising a 
tetracycline resistance plasmid and plasmids encoding one or both antfarads toxms (pxOl and/or 
px02) can be detected by using antibiotic resistance primer sets and toxin gene primer sets. If the 
B. anthracis is positive for tetracyclme resistance, tiien a different antibiotic, for example 
quinalone, is used. 

1 S While tiie present invention has been described with specificity in accordance with 

cotain of its embodiments, the following examples serve only to illustrate the invention and are 
not intended to limit the same. 

EXAMPLES 

20 Example 1: Nucleic Acid Isolation and PCR 

In one embodiment, nucleic acid is isolated from the organisms and amplified by PCR 
using standard methods prior to BCS detennination by mass spectrometry. Nucleic acid is 
isolated, for example, by detergent lysis of bacterial cells, centrifugation and ethanol 
precipitation. Nucleic acid isolation methods are described in, for example. Current Protocols in 

25 Molecular Biology (Ausubel et al.) and Molecular Cloning: A Laboratory Manual (Sambrook et 
al). The nucleic acid is then amplified using standard methodology, such as PCR, with primers 
which bind to conserved regions of the nucleic acid which contain an intervening variable 
sequence as described below. 

General Genomic DNA Sample Prep Protocol: Raw samples are filtered using Supor- 

30 200 0.2 pi monbrane syringe filters (VWR International) . Samples are transferred to 1.3 ml 
eppenddrftijbes pre-fiUed witii 0.45 g of 0.7 mm Zircohia beads followed by the addition of 350 
jJ of ATL buflfer (Qiagen, Valencia, CA). The samples are subjected to bead beating for 10 
nunules at a fiBquency of 19 1/s m a Retsch Vibration Mil ^etsch). After centrifiigation. 
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samples aie transferred to an S-block plate (Qiagen) and DNA isolation is completed with a 
BioRobot 8000 nucleic acid isolation robot (Qiagen). 

Swab Sample Protocol: Allegiance S/P brand culture swabs and collection/transport 
system are used to collect samples. After dryinfe swabs are placed in 17x100 mm culture tubes 
5 (VWR International) and the genomic nucleic acid isolation is carried out automaticaUy with a 
Qiagen Mdx robot and the Qiagen QIAamp DNA Blood BioRobot Mdx genomic preparation kit 
(Qiagen, Valraicia, CA). 

Example 2: Mass spectrometry 

10 FTICR Instrumentation: The FTICR instrument is based on a 7 tesla actively shielded 

superconducting magnet and modified Bruker Daltonics Apex n 70e ion optics and vacuum 
chamber. The spectrometer is interfaced to a LEAP PAL autosampler and a custom fluidics 
control system for high throughput screening applications. Samples are analyzed directly ficom 
96-well or 384-well microtiter plates at a rate of about 1 sample/minute. The Bniker data- 

1 5 acquisition platform is supplemented with a lab-buiit ancillary NT datastation which controls the 
autosampler and contains an arbitrary waveform generator capable of generating complex rf- 
excite waveforms (frequency sweeps, filtered noise, stored wavefonn inverse Fourier transform 
(SWIFT), etc.) for sophisticated tandem MS experiments. For oligonucleotides in the 20-30-mer 
re^e typical performance characteristics include.mass resolving power in excess of 100,000 

20 ^fWHM), low ppm mass measuranent errors, and an operable m/z range between 50 and 5000 
m/z. 

Modified ESI Source: In sample-limited analyses, analyte solutions are delivered at 150 
nUminute to a 30 mm i.d. fused-silica ESI emitter mounted on a 3-D micromanipulator. The ESI 
ion optics conasts of a heated metal cq)illary, an rf-only hexapole, a skimmer cone, and an 

25 auxiliary gate electrode. The 62 cm rf-only hex^le is comprised of 1 mm diameter rods and is 
operated at a voltage of 380 Vpp at a frequency of 5 MHz. A lab-built electro-mechanical shutter 
can be employed to prevent the electrospray plume from entering the inlet capillary unless 
triggered to the "open" position via a TTL pulse from the data station. Whai m the "closed" 
position, a stable electrospray plume is maintained between the ESI emitter and the &ce of the 

30 Gutter. The back fiice of the shutter arm contains an elastomeric seal that can be positioned to 
form a vacuum seal with the inlet caiHllary. When the seal is rcimoved, a 1 mm gap between the 
shutter blade and the capillary inlet allows constant pressure in the external ion reservoir 
regardless of whether the shutter is in tiie opoi or closed position. When the shutter is triggered, 
a "time slice" of ions is allowed to enta- the inlet capillary and is subsequentiy accumulated in 
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tiie external ion reservoir. The rapid response time of the ion shutter (< 25 ms) provides 
reproducible, user defined intervals during which ions can be injected into and accumulated in 
tiie external ion reservoir. 

Apparatus for Ir^ared Mdtiphoton DissocUxtiom A 25 watt CW CO2 laser operating at 

5 10.6 pm has been inter&ced to the spectrometer to mable mfiared muMphoton dissociation 
(IRMPD) for oligonucleotide sequencmg and other tandem MS applications. An aluminum 
optical bench is positioned approximately 1.S m fiom the actively shielded superconductmg 
magnet such that the laser beam is aligned with the central axis of the magnet Usmg standard 
IR-compatible mirrors and kinematic minor mounts, the unfocused 3 mm. laser beam is aligned 

10 to traverse directly through the 3.5 mm holes m the trapping electrodes of the FTICR trapped ion 
cell and longitu^nally traverse the hexi4x>le region of the external ion guide finally unpm^g 
on the skumner cone. This scheme allows IRMPD to be conducted in an m/z selective maimer in 
the tnqjped ion cell (e.g. following a SWIFT isolation of the spedes of mterest). or m a 
broadband mode in the high pressure region of the external ion reservou: where collisions with 

15 neutral molecules stabilize IRMPD-generated metastable fragment ions resulting m mcreased 
firagment ion yield and sequence coverage. 

Example 3: Identificaition of Bioagents 

Table 2 shows a small cross section of a database of calculated molecular masses for. 

20 over 9 primer sets and approximately 30 organisms. The primer sets were derived jSrom rRNA 
alignment. Examples of regions from rRNA consensus alignments are shown in Figines lA-lC. 
Lines with arrows are examples of regions to which intelligent primer pairs for PGR are 
designed. The primer pairs are >95% conserved in the bacterial sequence database (currently 
over 10,000 organisms). The intervening regions are variable in length and/or composition, thus 

25 providing the base composition "signature" (BCS) for each organism. Primer pairs were chosen 
so the total length of the amplified region is less than about 80-90 nucleotides. The label for each 
primer pair represents the starting and ending base number of the amplified region on the 
consensus diagram. 

Included in fiie short bact^al database cross-section in Table 2 are many well known 
30 pafliogens/biowarfaie agents (shown in bold/red typefece) such as BactUus anthracis or Yersinia 
pestis as "well as some of the bacteiial organisms found commonly' in the natural environment 
such as Streptomyces. Ev«i closely related organisms can be distinguished fiom each other by 
the apiHXjpriate choice of primers. For instance, two low G+C arganisms. Bacillus anthracis and 
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Staph aureus, can be distinguished fiom each other by using the primer pair defined by 
16S_1337 or 23S_855 (AM of 4 Da). 



Table 2: Cross Section Of A Database Of Calculated Molecular Masses' 



Prinwr Red Ions — * 
Bua Name 


18S_971 


6S_1100 


6S_1337 


I6S_1294 


I6S_1228 


!3S_1021 


23S_S55 


235.193 


23S_115 


AcinetDbacter catooaoetlcus 

Bordelella bronchlseptica 
BoitefQ buigdoiferi 
Bnteena abortus 
Cami^toiaclerjejimi 
Chlamydia pnuemonlae 
Ciostiidliimbotultnun) 










61295.4 


3029B 






54999 

■99660,3., 


9«K,1 


54387.9 






51295.4 


30295 


42651 


39560.5 


















519?0,5 




.K621.1 


28440.7 


35952,9 


61295,4 




429?S,9 


38941,4 


52524.6 




SS011 


















54388.9 


J29061,8 


3S856.8 


.60674,9 


,?0294 






.4573?,5 


55000 


,.5?7fi7,., 




?S9S5 








38341 


54999 


Enteracoccosliaeealls 

Eseherlshlacoll 

Frandsenatmarensls 




543860 


J«444,7. 
284478 


35853.9 
35955,9 


.61298,4 
51288.4 


-30297.. 


41417,8 
42652 




66612.2 
.5.6,8.4a,3, , 


53TM 


jaafi5_ 






^1391 

61?99 


30391 , 








Haemophilus Muenzae 
Klebsfefla pneumoniae 
Legionella pneumophila 
Mycobacterium avium 
Mycobacterium leprae 


55620.1,, 




29444,7 

284427 




51298.4 
51297.4 


30298 


42656 
42655 


39560.S 
39562.5 


55613.1 


55618 




28446 


35957 


51303 














«s<»4,a 
.29064,9 


,.»8.519_ 
3S86g,9 


5191S.S 
S1B17.S 


30298 , 


426,56.,, 


3994?,4.. 
39?59,5 


66241.2 
59240.2 


Mycoplasma gsnitaliun 
Mycoplasffla pneumoniae 
Neissetia gonontweae 


5?1497 


«1is,4 


.29081,8 


35980.9 


51301.4 
5QS71 .3 


30269 
302B4 


4?66e 
43264.1 


395SS.S 


56249.2 , 

56842.4 




.45118,4 

Jfiaaaa... 


.29fl91.«... 

28443 




51302,4 


30294 
30298 


42849 
.43272 


39559,5 „ 

39561.5 

39558 


59943,4 
55619 


Rickettsia prowazekB 
Rlclcettsiarieketldl 
SabnotieOa typhimurium 
Shigella dysenterlae 
Staphylococcus aureus 
Streptomyces 
Treponema pallidum 
Vibrio choleras 
Vibilo parahaemolyticus 
Yefslnia oestls 


WW— 

5?m 


?5f21 


28448 


J«5? 


S0S77 

.59979 


.3P2» 


42650 
,42948 


39559 , 


53755.. 




.65005... 






.51391 


















513Q1 














,29.443,Z... 














59245.? 


S8341.e 
.5.5931,1 


-2.%B.5.7... 


35858.9 
35851,9, 


51300,4,. 
61297.4 


3029^.. 


42034.9.. 


„.39Sfia.s 

38939.4 


.56864 3 

.57479.4, 




■S5?29 ■ 






_5253S 






,.,36241 




g4384,9 


55629,1 


-M444.7 


34620 7 


-50P94,?... 


















51299 











5 'Molecular mass distribution of PGR amplified regions for a selection of organisms (rows) 
across various primer pairs (columns). Pathogens are shown in bold. Empty cells indicate 
presently incomplete or missing data. 

Figure 6 shows the use of ESI-FT-ICR MS for measurement of exact mass. The spectra 
from 46mer PGR products originating at position 1337 of the 16S rRNA from S. aureus (upper) 

10 and B. arahraels (lower) are shown. These data are from the region of the spectrum containing 
signals from the [M-Sm-]*" charge states of the respective 5'-3' strands. The two strands di£fer 
by two (AT->CG) substitutions, and have measured masses of 14206.396 and 14208.373 + 
0.010 Da, respectively. The possible base compositions derived fiom the masses of the forward 
and reverse strands for &e B. emAracis prodiicts are listed in Table 3. 

15 Tabk 3: Possible base composition for B. anthrads products 



CalcMass 


Eihror 


BaseCompV 


14208293S 


0.079520 


Al G17C10T18 


142083160 


0.056980 


A1G20C15T10 
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14208.3386 


0.034440 


A1G23C20T2 


14208.3074 


0.065560 


A6G11C3T26 


14208.3300 


0.043020 


A6 G14aT18 


14208.3525 


0.020480 


A6G17C13T10 


14208.3751 


0.002060 


A6G20C18T2 


14208.3439 


0.029060 


A11G8C1T26 


14208.3665 


0.006S20 


A11G11C6T18 


14208.3890 


QM6m 


All G14C11T10 


14208.4116 


0.038560 


All G17C16T2 


14208.4030 


0.029980 


A16 G8C4T18 


14208.4255 


0.052520 


A16G11C9T10 


14208.4481 


0.075060 


A16 G14 C14 T2 


14208.4395 


0.066480 


A21 G5C2T18 


14208.4620 


0.089020 


A21G8C7T10 


14079.2624 


0.080600 


A0G14C13T19 


14079.2849 


0.058060 


A0G17C18T11 


14079.3075 


0.035520 


AO G20 C23 T3 


14079.2538 


0.089180 


A5 G5 CI T35 


14079.2764 


0.066640 


A5G8C6T27 


14079.2989 


0.044100 


A5G11C11T19 


14079.3214 


0.021560 


A5G14 C16T11 


14079.3440 


O.O0O9SO 


A5G17C21T3 


14079.3129 


0.030140 


A10G5C4T27 


14079.3354 


0.007600 


A10G8C9T19 


14079J579 


0,014940 


AlO GU C14T11 


14079.3805 


0.037480 


A10G14C19T3 


14079.3494 


0.006360 


A15G2C2T27 


14079.3719 


0.028900 


A15G5 C7T19 


14079.3944 


0.051440 


A15G8 C12T11 


14079.4170 


0.073980 


A15G1-1C17 T3 .. .. 


14079.4084 


0.065400 


A20G2C5T19 


14079.4309 


0.087940 


A2OG5C10T13 
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Among the 16 compositions for the forward strand and the 18 compositions for the reverse 
strand that vr&cc calculated, only one pair (shown in bold) are complementaiy, corresponding to 
the actual base compositions of die B. anthracis PGR products. 

5 Exnapls4:BCS of R9^onfr<mBadUus anthracis mdBadllusce^^ 

A conserved Bacillus region from B. anthracis (AmQsCuT?) and B. cereus 
(A15G9C13T9) having a C to A base change was synthesized and subjected to ESI-TOF MS. The 
results are shown in Figure 7 in v^ch the two regions are clearly distinguished using die metiiod 
of the present invention (NflW=14072.26 vs. 14096.29). 

10 

Example 5: Identification of additional bioagents 

In other examples of die present invention, the pathogen Vibrio cholera can be 
distinguished ftom Vibrio parahemolyticus wife AM > 600 Da using one of three 1 6S primer sets 
shown in Table 2 (16S_971, 16S_1228 or 16S_1294) as shown in Table 4. TTie two mycoplasma 

15 species in the list (M genitalium and Af. pneumoniae) can also be distinguished from each other, 
as can the three mycobacteriae. While the direct mass measurements of amplified products can 
identify and distinguish a large number of organisms, measurement of the base composition 
signature provides dramatically enhanced resolving power for closely related organisms. In cases 
such as Bacillus anthracis and Bacillus cereus that are virtually indistinguishable from each 

20 other based solely on mass differences, compositional analysis or fragmentation patterns are used 
to resolve the differences. The single base difference between the two organisms yields different 
fragmentation patterns, and despite the presence of the ambiguous/unidentified base N at 
position 20 in B. anthracis, the two organisms can be identified. 

Tables 4a-b show examples of primer pairs fix>m Table 1 which distinguish pathogens 

25 from background. 



Table 4a 



Organism name 


23S_855 


16S_1337 


23S_1021 


Bacittus anthracis 


42650.98 


28447.65 


30294.98 


^aphylococcus aureus 


42654.97 


28443.67 


30297.96 
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Table 4b 



Organism name 


16S_971 


16S_1294 


16S_1228 


Vibrio cholerae 


55625.09 


35856.87 


52535.59 


Vibrio parahaemofyticus 


54384.91 


34620.67 


50064.19 



Table 5 shows the expect molecular wei^t and base composition of region 
16S_1 100-1 188 m Mycobacterium avium and Streptomyces sp. 

5 

Tables 



R^lon 


Organism name 


Length 


Molecular 
weight 


Base comp. 


16S_1100-1188 


Mycobacterium avium 


82 


25624.1728 


A16G32C18T16 


16S_1 100-1188 


Streptomyces sp. 


96 


29904.871 


A17G38C27T14 



Table 6 shows base composition (single strand) results for 16S_1 100-1 188 primer 
amplification reactions different species of bacteria. Species which are repeated in the table 
10 (e.g., Clostridium botulintm) are different strains which have different base compositions in the 
16S_1 100-1 188 region. 



Table 6 



Organism name 


Base comp. 


Organism name 


Base comp. 


Mycobacterium avium 


A16G32CJ8T16 


Vibrio cholerae 


A23G30C21T16 


Streptomyces sp. 


Al7G3gC27Ti4 


Aeromonas hydrophila 


AjsGsiCnTis 


Ureaplasma weafyticum 


A18C530C17T17 


Aeromonas sabnonicida 


A23G31C21T1S 


Streptomyces sp. 


A|9G36C24Tig 


Mycoplasma gerdtalium 


A24G19C12T18 


Mycobacterium leprae 


A2oG32C22Ti6 


Clostridium botulinum 


A24G25CigT20 


M. tuberculosis 


A20G3JC11TK 


Bordetella bronchiseptica 


A24G26Ci9Ti4 


Nocardia asteroides 


A20G33C21T16 


Francisella tularensis 


A24G26C19T19 


Fvsobacterium necroforum 


A21G26C22T18 


Bacillus anihracis 


A24G26C20T18 


UsterUx monocytogenes 


A21G27C19T19 


Campylobacter Mmti 


A24G26C2oTi8 


Clostridium botuUnum 


A2iG27Ci9T2i 


Staphylococcus aureus 


A24G26C20T18 


Neisseria gonorrhoeae 


A2lG28C2iTig 


Helicobacter pylori 


A24G26C20T19 


Bartonella quintana 


A21G30C22T16 


Helicobacter pylori 


A24G26C21T18 
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Enterococcus faecalis 


A22G27C20T19 


Moraxella catarrhalis 


A24G26C23T16 


BaclUtis megaterium 


A22G28C20T18 


Haemophilus influenzae Rd 


A24G28C20T17 


Bacillus suhtilis 


A22G28C21T17 


Chlan^dia trachoma^ 




Pseudotnonas aeruginosa 


A22G29C23T15 


Chhai^dophUa pneumoniae 


A24G28C2l'ri6 


Legionella jpnewnopkila 


A22G32C20T16 


C pneumonia AR39 


A24G2sC2lTi( 


Afycoplastna pneumoniae 


AjsGioCuTifi 


Pseudomonas putida 


A24G29C21T16 


Clostridium botuUnum 


A2}G26C2oTi 9 


Proteus vulgaris 


A24G30C21T1S 


Enterococcus faeciutn 


A2 JG2SC2 iTi 8 


Yersinia pestis 


A24G30C21T15 


Acinetobacter calcoaceti 


A23G26C21T19 


Yersinia pseudotuberculos 


AJ4G30C21 Ti s 


Leptospira borgpeterseni 


A23G26C24T15 


Clostridium botulinum 


A25G24C18T21 


Leptospira interrogans 


A23 G2^ C24T15 


Clostridium tetani 


A2SG25C18T20 


Clostridium psrfringens 


A23G27C 19T1 9 


Francisella tularensis 


A25G25C 1 9T1 9 


BadlUis anthracis 


A23G27C2ftTlS 


Acinetobacter calcoacetic 


A2JG26C20T19 


Bacillus cereus 


A23G27C!2oT)8 


Bacteriodesfragilis 


A2sQ27C KT22 




A23Cr27C2o1']8 


Chlamydophila psittaci 


A2jC^7C2lTi< 


Aeromonas hydrophila 




Borrelia burgdor^ri 


A25G29C17T19 


Escherichia coli 


A23G29C2JTI6 


Streptobacillus monilifor 


A26G26C20T16 


Pseudomonas putida 


A23G29C2lTi7 


Rickettsia prowazekii 


A26G28C18T18 


Escherichia coli 


A23G29C22T15 


Rickettsia rickettsii 


A26G2gC2oTi6 


Shigella dysenteriae 


A23G2,C22Ti5 


Mycoplasma mycoides 


A28G23C 14X20 



The same organism having differ^ base compositions are different strains. Groups of 
organisms which are highlighted or in italics have the same base compositions in the amplified 
region. Some of these organisms can be distingiiished using multiple primers. For example, 

5 Bacillus anthracis can be cQstinguished from Bacillus cereus and Bacillus thuringiensis using the 
primer 16S_971-1062 (Table 7). Other primer pairs which produce imique base composition 
signatures are shown in Table 6 (bold). Clusteis containing very similar threat and ubiquitous 
non-threat organisms (e.g. anthracis cluster) are distinguished at high resolution with focused 
sets of primer pairs. The known biowarfare agents in Table 6 are Bacillus anthracis. Yersinia 

10 pestis, FranciseUattdea-ensis mi Ricfos^ 
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Organism 


16S_971-1062 


16S_m8-1310 


16S_1100-1188 


Aeromonas kydrophila 


A21G29C22T20 


A22G27C21TI3 


A23G31C21TI5 


Aeromonas sabnonicida 


A21G29C22T20 


A22G27C21T13 


Aa3G3iC2iTis 


Bacillus anthracis 


A21G27C22T22 


A24G22CI9T18 


Aa3G27C2oTi8 


Bacillus cereus 


A22G27C21T22 


A24Cb2Ci9Tj8 


A23G27C20T18 


Bacillus thuringjensis 


A22Ga7C2lT22 


A24<^Ci9Ti8 


A23G27C20T18 


Chlamydia trachomatis 


A22G26C2oT2J 


A24G23CI9T16 


A24G28C2lTi6 


Chlamydia pneumoniae AR39 


A26Q23C20T22 


A26(^Ci6Ti8 


A24G28C21T16 


Leptospira horgpetersenii 


A22G26C20T21 


A22G25C2IT15 


A23G26C24T15 


Leptospira Interrogans 


A22G26C2QT21 


A22G25C21T15 


A23G26C24T15 


Mycoplasma genitalium 


A28G23C15T22 


AaoGisCisTw 


A24G19C12T18 


Mycoplasma pneumoniae 


A2»G23Ci5T22 


A276|9Ci«T20 


A23G20C14T16 


Escherichia coli 


A22G28C2»T22 


A24G25C21T13 


A23G29C22T15 


Shigella dysenteriae 


A22Ga8C2lT21 


A24G25C21T13 


A23G29C22T15 


Proteus vulgaris 


A23G26C22T21 


A26G24C19TI4 


A24G30C21T15 


Yersinia pestis 


A24G25C21T22 


A25G24C2oTi4 


A24G30C21T15 


Yersinia pseudotuberculosis 


A24G25C21T22 


A2SG24C20T14 


A24G30C2lT,5 


Francisella tularensis 


A20G2SC21T23 


A33G26C17TI7 


A24G26C19T19 


Rickettsia prawazeHi 


A2iG26C24T25 


A24G23C16T19 


A2«G28Ci8Ti8 


Rickettsia rickettsii 


A2lGifiC2sT24 


A24G24C17T17 


A26G28C2oTi« 



The sequence of B. anthracis and B. cereus in region 16S_971 is shown below. Shown 
in bold is the single base difference betwe^ the two species tibat can be detected using the 
5 methods of the present inventioiL B. anthracis has an ambiguous base at position 20. 
B.anthracis_l6S_91l 

GCGAAGAACCUUACCAGGUMJUGACAUCCUCUGACAACCCUAGAGAUAGOGCUUC 
UCCUUCGGGAGCAGAGUGACAGGUGGUGCAUGGUU (SEQ ID N0:1) 
B.cereus_l6S_97l 

10 GCOAAGAACCUUAeCAGGUCUUGACAUCCUCUOAAAAeeCUAGAOAUAGGGCUUC 
UCCUUCGGQAGCAGAGUGACAGGUGGUGCAUGGUU (SEQ ID N0:2) 
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Example 6: ESI-TOF MS-of sspE S6-mer Plus Calibrant 

The mass measurement accuracy that can be obtained using an internal mass standard in 
the ESI-MS study of PGR products is showi in Fig.8. The mass standard was a 20-mer 
phosphorofliioate oligonucleotide added to a solution containiag a S6-mer PGR product.fiom the 
5 B. anthracis spote coat protein sspE. The mass of the expected PGR product distinguishes B. 
anthracis fiom other species of Bacillus such as B. thuringiemis and B. cereus. 

Example 7: A mthraeis ESI-TOF Synthetic 16S_1228 Daplex 

An ESI-TOF MS spectrum was obtained from an aqueous solution containing 5 ^M 
10 each of synthetic analogs of the expected forward and reverse PGR products from the nucleotide 
1228 region of the B. anthracis 168 rRNA gene. The results (Fig. 9) show that the molecular 
weights of the forward and reverse strands can be accurately determined and easily distinguish 
the two strands. The [M-21H*]^'" and [M-20H*f charge states are shown. 

1 5 Example 8: ESI-FTICR-MS of Synthetic B. anthracis 16S_1337 46 Base Paw Duplex 

An ESI-FnCR-MS spectrum was obtained from an aqueous solution containing 5 \M 
each of synthetic analogs of the expected forward and reverse PGR products from the nucleotide 
1337 region of the B. anthracis 16S rRNA gene. The results (Fig. 10) show ttiat the molecular 
wei^ts of tiie strands can be distinguished by this method. The [M-ldH*!'*" through [M- 

20 lOH*]"^ charge states are showa The insert highlights the resolution that can be realized on flie 
FTIGR-MS mstrument, viuch allows the charge state of the ion to be detennmed from the mass 
difference between peaks differing by a single 13G substitution. 

Example 9: ESI-TOF MS of 56-mer Oligonucleotide from saspB Gene otB. anthracis with 
25 Internal Mass Standard 

ESI-TOF MS spectra were obtained on a synthetic 56-mer oligonucleotide (5 pM) from 
the saspB gene of B. anthracis containing an internal mass standard at an ESI of 1 .7 pl/min as a 
frmction of sample consumption. The results (Fig. 11) show that the signal to noise is improved 
as more scans are summed, and that ibs standard and the product are visible after only 100 scans. 

30 

Example 10: ESI-TOF MS of an Internal Standard with Tribntylammonium CTBA)- 
trifluoroacetate (TFA) Buffer 
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An ESI-TOF-MS spectmin of a 20-iner phosphorothioate mass standard was obtained 
following addition of 5 mM TBA-TFA.buff» to the solution. Hiis buffer strips charge from Ihe 
oligonucleotide and shifts &e most abundant diarge state fiom [M-SlT]^ to [M-Slf]^ (Fig. 12). 

5 Example II: Master Database Comparison 

The molecular masses obtained througji Examples 1-10 are compared to molecular 
masses of known bioagents stored in a master database to obtain a high probability matching 
molecular mass. 



10 Example 12: Master Data Base Interrogatton over the Internet 

The same procedure as in Example 11 is followed except that the local computer did not 
store die Master database. The Master database is interrogated over an intemet connection, 
searching for a molecular mass matdi. 

IS Example 13: Master Database Updating 

The same procedure as in example 1 1 is followed except the local computer is 
connected to the intemet and has the ability to store a master database locally. The local 
computer system periodically, or at the iiser's discretion, interrogates the Master database, 
synchronizing the local master database with the global Master database. Hiis provides the 

20 current molecular mass information to both the local database as well as to 'die global Master 
database. This fiirther provides more of a globalized knowledge base. 



Example 14: Global Database Updating 

The same procedure as in example 1 3 is followed except there are numerous such local 
25 stations throughout the world. The synchronization of each database adds to the diversity of 
information and diversity of the molecular masses of known bioagents. 



Example 15: Demonstration of Detection and Identification of Five Species of Bacteria in a 
Mixture 

30 Broad range intelligent primers were chosen following analysis of a large collection of 

cirtated bacterfel 16S rRNA sequences rqpresentmg greiater than 4000 iqiecies of bacteria. 
Examples of primers capable of pruning fam greater dian 90% of the organisms in the 
collection include, but are not limited to, ^ose exhibited in Table 8 M^erein Tp S'propynylated 
uridine and Cp = 5*propynylated cytidine. 
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Tables 



Intelligent Primer Pairs for Identification of Bacteria 



Primer 
Pair Nasie 


Forward Primer 
Sequence 


Forward 
SEQ ID 
NO: 


Reverse Primer 
Seepience 


Reverse 
SEQ ZD 
NO: 


16S EC 107 
7_1195 


3TGAGATGTTGGGTTAAGTCCC 
GTAflCGAG 








16S EC LOS 
2 1197 


^TGTTGGGTTAAGTCCCGCAAC 
GAG 




TTGACGTCATCCCCACCTTCCT 
C 




16S EC 109 
0 1196 


TTAAGTCCCGCAACGATCGCAA 




TGACGTCATCCCCACCTTCCTC 




16S EC 122 
2 1323 


GCTACACACGTGCTACAATG 




3GAGTT GCAGACTGCGAT CCG 




16S EC 133 
2 1407 


AAGTCGGAATCGCTAGTAATCG 




jACGGGCGGTGTGTACAAG 




16S EC 30 
126 


TGAACGCTGGTGGCATGCTTAA 

CAC 




TACGCATTACTCACCCGTCCGC 




16S EC 38 
"120 


GTGGCATGCCTAATACATGCAA 
GTCG 


20 


TTACTCACCCGTCCGCCGCT 


21 


les EC 49 
120* 


TT^CACATGCAAGTCGAACG 




TTACTCACCCGTCCGCC 




16S EC 663 
795 


6TGTAGCGGTGAAATGCG 




GTATCTAATCCTGTTTGCTCCC 




16S EC 713 
"809 


AGAACACCGATGGCGAAGGC 




CGTGGACTACCAGGGTATCTA 




16S EC 785 
897 


6GATTAGAGACCCTGGTAGTCC 




GGCCGTACTCCCCAGGCG 




16S EC 785 
_897_2 


GGATTAGATACCCTGGTAGTCC 
ACGC 




GGCCGTACTCCC CAGGCG 




16S EC 789 
894 


TAGATACCCTGGTAGTCCACGC 


32 


CGTACTCCCCAGGCG 


33 


16S EC 960 
1073 


TT C GAT GCAACGCGAAGAACCT 








16S EC 969 
1078 


ACGCGAAGAACCTTAOC 


36 


ACGACACGAGCTGACGAC 


37 


23S EC 182 
6 1924 


CTGACACCTGCCCGGTGC 


38 


GACCGTTATAGTTACGGCC 


39 


23S EC 264 
5 2761 


TCTGTCCCTAGTACGAGAGGAC 
CGG 


40 


TGCTTAGATGCTTTCAGC 


41 


23S_EC_264 
5 2767 


CTGTCCCTAGTACGAGAGGACC 


42 


GTTTCATGCTTAGATGCTTTCA 


43 


23S EC 493 

571 


G6GGAGTGAAAGAGATCCTGAA 
ACCG 


44 


ACAAAA6GTACGCCGTCACCC 


45 


23S EC 493 
_571_2 


GGGGAGTGAAAGAGATCCTGAA 
ACCG 


46 


ACAAAAGGCACGCCATCACCC 


47 


23S EC 971 
1077 


CGAGAGGGAAACAACCCAGACC 


48 


TGGCTGCTTCTAAGCCAAC 


49 


INFB EC 13 
65_1467 


TGCTCGTGGTGCACflaGTAACG 
GATATTA 


50 


TGCTGCTTTCGCATGGTTAATT 
GCTTCAA 


51 


RPQC EC 10 
18_1124 


CAAAACTTATTAG6XAAGCGTG 
TTGACT 


.52 


TCAAGCGCCATTTCTTTT.GGTA 
AACCACAT 


-. -53. _ 


RPOC EC 10 
18_1124_2 


CAAAACTTATTAGGTAAGCGTG 
TTGACT 


54 


TCAAGCGCCATCTCTTTCGGTA 
ATCCACAT 


55 


RPOC EC 11 
4_232 


TAAGAAGCCGGAAACCATCftAC 
TACCG 


56 


GGCGCTTGTACTTACCGCAC 


57 
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EPOC EC 21 
78 2246 


^GATTCTGGTGCCCGTGGT 




'TGGCCATCAGGCCACGCATAC 


59 


RPOC EC 21 
78_2246_2 


rGATTCCGGTGCCCGTGGT 


€0 


rTGGCCATCAGACCACGCATAC 


61 


RPOC EC 22 
18 2337 


CTGGCAGGTATGCGTGGTCTGA 

rG 


62 


CGCACCGTGGGTTGAGATGAAG 

lAC 


63 


RPOC EC 22 
18_2337_2 


CTTGCTGGTATGCGTGGTCTGA 
TG 


64 


CGCACCATSCGTAGA6ATGAAG 
TAG 


65 


RPOC EC 80 
8^88 9 


CGTCG6GTGATTAACCGTAACA 

ACCG 


66 


GTTTTTCGTTGCGTACGATGAT 
GTC 


67 


RPOC EC 80 
8_891 


CGTCGTGTAATTAACCGTAACA 
ACCG 


68 


ATGCT 


69 


RPOC EC 99 
3_1059 


3^AAGGTAAGCAAGGTC6TTTC 
CGTCA 




]!GAACGGCCTGAGTAGTCAAVCA 
CG 




RPOC EC 99 
3_i059_2 


CAAAGGTAAGCAAGGACGTTTC 
CGTCA 


72 


CGAACGGOCAGAGTAGTCAACA 
CG 


73 


TOFB EC 23 
9 303 


TAGACTGCCCAGGACACGCTG 


74 


GCCGTCCATCTGAGCAGCACC 


75 


TOFB EC 23 
9 303 2 


TTGACTGCCCAGGTCACGCTG 




3CCGTCCAT TTGAGCAGC ACC 




TOFB EC 97 
6 1068 


AACTACCGTCCGCAGTTCTACT 
TCC 


78 


GTTGTCGCCAGGCATAACCATT 
TC 


79 


TUFB EC 97 
6_1068_2 


AACTACCGTCCTCAGTTCTACT 

TCC 


80 


GTTGTCACCAGGCATTACCATT 
TC 


81 


TOFB EC 98 
510 62 


CCACAGTTCTACTTCCGTACTA 
CTGACG 


82 


TCCAGGCATTACCATTTCTACT 
CCTTCTGG 


83 


RPLB EC 65 

ojr62" 


GACCTACAGTAAGA66TTCTGT 
AATGAACC 


84 


TCCAAGTGCTGGTTTACCCCAT 
GG 


85 


RPLB EC 68 
8 "757" 


CATCCACACGGTGGTGGTGAAG 

G 


86 


GTGCTGGTTTACCCCATGGAGT 


87 


RPOC EC 10 
36~1126 


CGTGTTGACTATTCGGGGCGTT 

CAG 


88 


ATTCAAGAGCCATTTCTTTTGG 
TAAACCAC 


89 


RPOB EC 37 
62_3865 


TCAACAACCTCTTGGAGGTAAA 
GCTCAGT . 




TTTCTTGAAGAGTATGAGCTGC 
TCCGTAAG 




RPLB EC 68 
8_771 


CATCCACACGGTGGTGGTGAAG 
G 




TGTTTTGTATCCAAGTGCTGGT 
TTACCCC 




VALS EC 11 
05 1218 


CGTGGCGGCGTGGTTATCGA 




CGGTACGAACTGGATGTCGCCG 
TT 




RPOB EC 18 
45 1929 


TATCGCTCAGGCGAACTCCAAC 


96 


GCTGGATTCGCCTITGCTACG 


97 


RPLB EC 66 
9_761 


TGTAATGAACCCTAAT GACCAT 
CCACACGG 


98 


CCAAGTGCTGGTTTACCCCATG 
GAGTA 


99 


RPLB EC 67 
1_762 


TAATGAACCCTAATGACCATCC 
ACACGGTG 




TCCAAGTGCTGGTTTACCCCAT 
GGAG 




RPOB EC 37 
75_3858 


CTT GGAGGTAAGTCTCATTTTG 
GTGGGCA 




TGTAATGC 




VALS EC 18 
33 19r3 


CGAC6CGCTGCGCTTCAC 




GCGTTCCACAGCTTGTTGCAGA 
AG 




RPOB EC 13 
35 1455 


GACCACCTGGGCAACCGT 


lOG 


TT-CGCTCTCGGCCTGGCC 


107 


TUFB EC 22 
5_309 


GCACTATGCACACGTAGATTGT 
CCTGG 


108 


TATAGCACCATCCATCTGAGCG 
GCAC 


109 


DNAK EC 42 
B~522 


CGGCGTACTTCAACGACAGCCA 


110 


CGCGGTCGGCTCGTTGATGA 


111 
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VALS EC 19 ( 
20 1970 


:ttctgcaacaagctgtggaac 

3C 


112 


PCGCAGTTCATCAGCACGAAGC 


113 


TUFB EC 75 
7 857 


'^GACGACCTGCACGGGC 






115 


23S EC 264 
6 2765 


:TGTTCTTAGTACGA6AGGACC 




'TCGTGCTTAGATGCTTTCAG 




16S EC 969 
_1078_3P 


iVCGCGAAGAACCTTACpC 




LCGACACGAGCpTpGACGAC 




16S EC 972 
_1075_4P 


3GAAGAACpCpTTACC 


120 


\CACGAGCpTpGAC 


121 


16S EC 972 
1075 


CGAAGAACCTTACC 


122 


"iCACGAGCTGAC 


123 


23S EC_- 
347 59 


CCTGATAAG6GTGAGGTCG 


124 


ACGTCCTTCATCGCCTCTGA 


125 


23S EC - 
7 450 


GTTGTGA6GTTAAGCGACTAAG 


126 


CTATCGGTCAGTCAGGAGTAT 


127 


23S EC - 
7 910 


GTTGTGA66TTAAGCGACTAA6 


128 


TTGCRTCGGGTTGGTAAGTC 


129 


23S EC 430 
~1442 


ATACTCCTGACTGACCGATAG 


130 


AACATAGCCTTCTCCGTCC 


131 


23S EC 891 
"1931 


GACTTACCAACCCGATGCAA 


132 


TACCTTAGGACCGTTATAGTTA 
CG 


133 


23S EC 142 
4 2494 


GGACGGAGAAGGCTATGTT 


134 


CCAAACACCGCCGTCGATAT 


135 


23S EC 190 
8 2852 


CGTAACTATAACGGTCCTAAGG 
TA 


136 


GCTTACACACCCGGCCTATC 


137 


23S_EC_247 
5 3209 


ATATCGACGGCGGTGTTTGG 


138 


GCGTGACAGGCAGGTATTC 


139 


16S EC - 
60 525 


AGTCTCAAGAGTGAACACGTAA 


140 


GCTGCTGGCACGGAGTTA 


141 


16S EC 326 
1058 


GACACGGTCCAGACTCCTAC 


142 


CCATGCAGCACCTGTCTC 


143 


16S EC 705 
1512 


GATCTGGAGGAATACCGGTG 


144 


ACGGTTACCTTGTTACGACT 


145 


16S EC 126 
8 1775 


GAGAGCAAGCGGACCTCArA 


146 


CCTCCTGCGTGCAAAGC 


147 


GROL EC 94 
1 1060 


TGGAAGATCTGGGTCAGGC 


148 


CAATCTGCTGACGGATCTGAGC 


149 


INFB EC 11 
03 1191 


GTCGTGAAAACGAGCTGGAAGA 


150 


CATGATGGTCACAACCGG 


151 


HFLB EC 10 
82 1168 


TGGCGAACCTGGTGAACGAAGC 


152 


CTTTCGCTTTCTCGAACTCAAC 
CAT 


153 


INFB EC 19 
6 9^0 58 


CGTCAGGGTAAATTCCGTGAAG 
TTAA 


154 


AACTrCGCCTTCGGTCATGTT 


155 


GROL EC 21 
9 "350" 


GGTGAAAGAAGTTGCCTCTAAA 
GC 


156 


TTCAGGTCCATCGGGTTCATGC 
C 


157 


VALS EC 11 
05~1214 


CGTGGCGGCGTGGTTATCGA 


158 


ACGAACTGGATGTCGCCGTT 


159 


16S EC 556 
700 


CGGAATTACTGGGCGTAAAG 


160 


CGCATTTCACCGCTACAC 


161 


RPOC EC 12 
56 1315 


ACCCAGTGCTGCTGAACCGTGC 


162 


GTTCAAATGCCTGGATACCCA 


163 


16S EC 774 
894 


GGGAGCAAACAtSGATTAGATAC 


164 


CGTACTCCCCAGGCG . " 


■ 165 


RPOC EC 15 
84 1643 


TGGCCCGAAAGAAGCTGAGCG 


166 


ACGCGGGCATGCAGAGATGCC 


167 


16S EC 108 
2 1196 


ATGTTG6GTTAAGTCCCGC 


168 


TGACGTCATCCCCACCTTCC 


169 
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16S EC 138 
9 1541 


CTTGTACACACCGCCCGrC 


170 


AAGGAGGTGATCCAGCC 


171 


16S EC 130 
3~1407 


CGGATTGGAGTCTGCftACTCG 


172 


GACGGGCGGTGTGTACAAG 


173 


23S EC 23 
~130 


GGTGGATGCCTTGGC 


174 


GGGTTTCCCCATTCG6 


175 


23S EC 187 
256 


GGGAACTGAAACATCTAAGTA 


176 


TTCGCTCGCCGCTAC 


177 


23S EC 160 
2 1703 


TACCCCAAACCGACACAGG 


178 


CCTTCTCCCGAAGTTACG 


179 


23S EC 168 
5 1842 


CCGTAACTTCGGGAGAAGG 


180 


CACCGGGCA6GCGTC 


181 


23S EC 182 
7 1949 


GACGCCTGCCCGGTGC 


182 


CCGACAA6GAATTTCGCTACC 


183 


23S EC 243 
4~2511 


AAGGTACTCCGGGGATAACAGG 
C 


184 


AGCCGACATCGAGGTGCCAAAC 


185 


23S EC 259 
9~2669 


GACAGTTC6GTCCCTATC 


186 


CCGGTCCTCTCGTACTA 


187 . 


23S EC 265 
3~2758 


TAGTACGAGAGGACCGG 


188 


TTAGATGCTTTCAGCACTTATC 


189 


23S BS - 
68 21 


AAACTAGATAACAGTAGACATC 
AC 


190 


GTGCGCCCTTTCTAACTT 


191 


16S EC 8 3 
~58 


AGAGTTT6ATCATGGCTCAG 


192 


ACTGCTGCCTCCCGTAG 


193 


16S EC 314 
"575 


CACTGGAACTGAGACACGG 


194 


CTTTACGCCCAGTAATTCCG 


195 


16S EC 518 
795 


CCAGCAGCCGCGGTAATAC 


196 




197 


16S EC 683 
~985 


GTGTAGCGGTGAAATGCG 


193 


GGTAAGGTTCTTCGCGTTG 


199 


16S EC 937 
"1240 


AAGCGGTGGAGCATGTGG 


200 


ATTGTAGCACGTGTGTAGCCC 


201 


16S EC 119 
5 1541 


CAAGTCATCATGGCCCTTA 


202 


AAGGAGGTGATCCAGCC 


203 


16S EC 8 1 
541 


AGAGTTTGATCATGGCTCAG 


204 


AAGGAGGTGATCCAGCC 


205 


23S EC 183 
1 1936 


ACCTGCCCAGTGCTGGftAG 


206 


TCGCTACCTTAGGACCGT 


207 


16S EC 138 
7 1513 


GCCTTGTACACACCTCCCGTC 


208 


CACGGCTACCTTGTTACGAC 


209 


16S EC 139 
0 1505 


TTGTACACACCGCCCGTCATAC 


210 


CCTTGTTACGACTTCACCCC 


211 


16S EC 136 
7~1506 


TACGGTGAATACGTTCCCGGG 


212 


ACCTTGTTACGACTTCACCCCA 


213 


16S EC 804 
929 


ACCACGCCGTAAAC6ATGA 


214 


CCCCCCTCAATTCCTTTGAGT 


215 


16S EC 791 
904 


GATACCCTGGTAGTCCACACCG 


216 


GCCTTGCGACCGTACTCCC 


217 


16S EC 789 
899 


TAGATACCCTGGTAGTCCACGC 


218 


GCGACCGTACTCCCCAGG 


219 


16S_EC_109 


TAGTCCCGCAACGAGCGC 


220 


GACGTCATCCCCACCXTCCTCC 


221 


23S EC 258 
6"2677 


TAGAACGTCGCGAGACAGTTCG 


222 


AGTCCATCCCGGTCCTCTCG 


223 


HEXftMER EC 
61 362 


GAGGAAAGTCCGGGCTC 


224 


ATAAGCCGGGTTCTGTCG 


225 


RNASEP BS 
43 384 


GAGGAAAGTCCATGCTCGC 


226 


GTAAGCCATGTTTTGTTCCATC 


227 


RNASEP EC 


GAGGAAAGTCCGGGCTC 


228 


ATAAGCCGGGTTCTGTCG 


229 
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61 362 










YAED_TRNA_ 

ALA- 
RRNH EC 51 

3 49 


GCGGGATCCTCTAGA6GTGTTA 
AATAGCCTGGCAG 


230 


GCGGGATCCTCTAGAAGACCTC 
CTGCGTGCAAAGC 


231 


RNASEP SA 
31 379 


GAGGAAAGTCCATGCTCAC 


232 


ATAAGCCATGTTCTGTTCCATC 


233 


16S EC 108 
2 1541 


ATGTTGGGTTAAGTCCCGC 


234 


AAGGAGGTGATCCAGCC 


235 


16S EC 556 
795 


CGGAATTACTGGGCGTAAAG 


236 


GTATCTAATCCTGTTTGCTCCC 


237 


16S EC 108 
2_1196_10G 


ATGTTGGGTTAAGTCCCGC 


238 


TGACGTCATGCCCACCTTCC 


239 


15S EC 108 
2 1196 lOG 
IIG 


ATGTTGGGTTAAGTCCCGC 


240 


TGACGTCATGGCCACCTTCC 


241 


TRNA ILERR 
NH ASPRRNH 
EC 32 41 


GCGGGATCCTCTAGACCTGATA 
AGGGTGA6GTCG 


242 


GC6GGATCCTCTAGAGCGTGAC 
AGGCAGGTATTC 


243 


16S EC 969 
1407 


ACGCGAAGAACCTtACC 


244 


GACGGGCGGTGTGTACAAG 


245 


les EC 683 
~1323 


GTGTAGCGGTGAAATGCG 


246 


CGAGTTGCAGACTGCGATCCG 


247 


16S EC 49 
894 


TAACACATGCAAGTCGAACG 


248 


CGTACTCCCCAGGCG 


249 


16S EC 49 
1078 


TAACACATGCAAGTCGAACG 


250 


AC6ACAC6AGCTGACGAC 


251 


CYA BA 134 
9 1447 


ACAACGAAGTACaUVTACAAGAC 


252 


CTTCTACATTTTTAGCCATCAC 


253 


16S EC 109 
0_1196_2 


TTAAGTCCCGCAACGAGCGCAA 


254 


TGACGTCATCCCCACCTTCCTC 




16S EC 405 
_527 


TGAGTGATGAAGGCCTTAGGGT 
TGTAAA 


256 


CGGCTGCTGGCACGAAGTTAG 


257 


GROL EC 49 
6 596 


ATGGACAAGGTTGGCAAG6AAG 
G 


258 


TAGCCGC6GTCGAATTGCAT 


259 


GROL EC 51 
i_593 


AAGGAAGGCGTGATCACCGTTG 
AAGA 


260 


CCGCGGTCGAATTGCATGCCTT 
C 


261 


VALS EC 18 
35 1928 


ACGCGCTGCGCTTCAC 


262 


TTGCA6AAGTTGCGGTAGCC 


263 


RPOB EC 13 
34 1478 


TCGACCACCTGGGCAACC 


264 


ATCAG6TCGTGCGGCATCA 


265 


DNAK EC 42 
0 521 


CACGGTGCCGGCGTACT 


266 


GCGGTCGGCTCGTTGATGAT 


267 


RPOB EC 37 
76 3853 


TTGGAGGTAAGTCTCATTTTGG 
TGG 


268 


AAGCTGCACCATAAGCTTGTAA 
TQC 


269 


RPOB EC 38 
02~3885 


CA6CGTTTCGGCGAAATGGA 


270 


CGACTTGACGGTTAACATTTCC 
TG 


271 


RPOB EC 37 
99_3888 


GGGCAGCGTTTCGGCGAAATGG 
A 


272 


6TCCGACTTGAC6GTCAACATT 
TCCTG 


273 


RPOC EC 21 
46_2245 


CAGGAGTCGTTCAACTC6ATCT 
ACATGAT 


274 


ACGCCATCAGGCCACGCAT 


275 


ASPS EC -40 
"5 538 




276 • 


AGG6CACGAGGTAGTCGC • 


• 277 


RPOC EC 13 
74~1455 


CGCCGACTTCGACGGTGACC 


278 


GAGCATCAGCGT6CGTGCT 


279 


TOFB EC 95 
7 1058 


CCACACGCCGTTCTTCAACAAC 
T 


280 


GGCATCACCATTTCCTTGTCCT 
TCG 


281 
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163 EC 7 1 
22 


6A6A6TTTGATCCTG6CTCAGA 
ACGAA 


282 


TGTTACTCACCCGTCTGCCACT 


283 


VALS EC 51 
0"727 


ACCGAGCAA6GAGACCAGC 


284 


TATAACGCACATCGTCAGGGTG 
A 


285 



For evaluation in the laboratory, five species of bacteria were selected including three y- 
proteobacteria (E. coli, K. pneumoniae and P. auergiosa) and two low G+C gram positive 
bacteria (B. siibtilitis and S. aureus). The identities of the organisms were not revealed to the 
5 laboratory technicians. 

Bacteria were grown in culture, DNA was isolated and processed, and PCR performed 
using standard protocols. Following PCR, all samples were desalted, concentrated, and analyzed 
by Fourier Transform Ion Cyclotron Resonance (FTICR) mass spectrometry. Due to the 
extremely high precision of the FTICR, masses could be measured to within 1 Da and 

10 unambiguously deconvoluted to a single base composition. The measured base compositions 
were compared with the known base composition signatures in our database. As expected viien 
using broad range survey 16S primers, several phylogenetic near-neighbor organisms were 
^fEuHilt to distmgoish fiom our test organisms. Additional non-ribosomal prim^ were used to 
triangulate and fiirther resolve fliese clusters. 

IS Aa example of the use of primers directed to regions of KNA polymerase B (rpoB) is 

shown in Figure 19. This gene has the potential to pro^nde broad priming and resolving 
capabilities. A pair of primers directed against a conserved region of ipoB provided distinct base 
composition signatures that helped resolve the tight mterobacteiiae cluster. Joint probability 
estimates of it& signatures &om each of the primers resulted in the identification of a single 

20 organism fiiat matched the identity of the test sample. Therefore a combmation of a small 
number of primers that amplify selected regions of tide 16S ribosomal RNA gene and a few 
additional primers that amplify selected regions of protein encoding genes provide sufGcient 
information to detect and identify all bacterial patiiogens. 

25 Example 16: Detection oi Staphylococcus aureus in Blood Samples 

Blood samples in an analysis plate were spiked with genomic DNA equivalent of 10^ 
organisms/ml of Staphylococctis aureus. A single set of 16S rRNA primers was used for 
. amplification. Following PCR, all samples were desalted, concentrated, and analyzed by Fourier 

Transform Ion Cyclotron Resonance (FTICR) mass spectrometry. In each of the spiked wells, 
30 strong signals were detected vt^ch are consistent with the expected DCS of the S. aureus 
amplicon (Figure 20). Furth«more, tiiere was no robotic carryover or contamuiation m any of 
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flie blood only or water blank wells. Methods similar to this one will be applied for other 
clinically relevant samples including, but not limited to: urine and throat or nasal swabs. 

Example 17: Detection and Serotyping of Viruses 
5 The virus detection capability of die present invention was demonstrated in 

collaboration with Naval health officers using adenoviruses as an example. 

All available genomic sequences for human adenovmises available in public databases 

were surveyed. Hie hexon gene was identified as a candidate likely to have broad specificity 

across all serotypes. Four primer pairs were selected from a groiq) of primers designed to yield 
10 broad coverage across the majority of the adenoviral strwn types (Table 9)|wherem Tp = 

5'propynylated uridine and Cp = 5'propynylated cytidine. 

Table 9 



Intelligent Primer Pairs for Serotyping of Adenoviruses 



Primer Pair 


rorvrard. Primer 
Sequence 


SBQ ID 
KO: 


Sequence 


SBQ ZD 
NO: 


HEX HAD7+4+2 
1 934 995 


AGACCCAATTACATTGGCTT 


286 


CCAGTGCTGTTGTAGTACAT 


287 


HEX HAD7+4+2 
1 976 105O 


ATGTACTACAACAGTACTGG 


288 


CAAGTCAACCACAGCATTCA 


289 


HEX HAD7+4+2 
1 970 1059 


GGGCTTATGTACTACAACAG 


290 


TCTGTCTTGCAAGTCAACCAC 


291 


HEX HAD7+3 7 
71 827 


GSAATTTTTTGATGCTAGAGA 


292 


TAAAGCACAATrrCAGGCG 


293 


HEX HAD4+16 
746 848 


TAGATCTGGCTTTCTTTGAC 


294 


ATATGAGTATCTGGAGTCTGC 


295 


HEX HAD7 509 
578 


GGAAAGACATTACTGCAGACA 


296 


CCAACTTGAGGCTCTGGCTG 


297 


HEX BAD4 121 
6 1289 


ACAGACACTTACCAGGGTG 


298 


ACTGTGGTGTC3VTCTTTGTC 


299 


HEX iiAD21 51 
5 567 


TCyVCTAAAGACAAAGGTCTTCC 


300 


GGCTTCGCCGTCTGTAATTTC 


301 


HEX HAD 1342 
1469 


CGGATCCAAGCTAATCTTTGG 


302 


GGTATGTACTCATAGGTGTTG 
GTG 


303 


HEX HAD7+4+2 
1 934 995P 


AGACpCpCAATTpACpATpXGG 
CTT 


304 


CpCpAGTGCTGTpTpGTAGTA 
CAT 


305 


HEX HAD7+4+2 
1 976 1050P 


ATpGTpACTpACAACAGTACpT 
pGG 


306 


CAAGTpCpAACCACAGCATpT 
pCA 


307 


HEX HAD7-I-4+2 
1 970 1059P 


GGGCpTpTATpGTpACTACAAC 
pAG 


308 


TCTGTpCpTTGCAAGTpCpAA 
CCAC 


309 


HEX HAD7+3 7 
71 827P ■■ ■ 


GGAATTpTpTpTpTGATGGTAG 

AGA ■ ■ . " '■ 


310 


TAAAGCACAATpTpTpCpAGG 

CG • - 


311 


HEX HAD4-H6_ 
746 848P 


TAGATCTGGCTpTpTpCpTTTG 
AC 


312 


ATATGAGTATpCpTpGGAGTp 
CpTGC 


313 


HEX HAD 1342 
14 69P 


CGGATpCCAAG(^TAATCpTpT 
TGG 


314 


GGTATGTACTCATAGGTGTpT 
pGGTG 


315 


HEX HAD7+2H- 


AACAGACCCAATTACATTGGCT 


316 


GAGGCACTTGTATGTGGAAAG 


317 
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3 931 1645 


T 




G 




HEX HAD4+2 9 
25 1469 


ATGCCTAACAGACCCAATTACA 
T 


318 


TTCATGTAGTCGTAGGTGTTG 
G 


319 


HEX HAD7+21+ 
3 384 953 


CGCGCCTAATACATCTCAGTGG 
AT 


320 


AAGCCAATGTAATTGGGTCTG 
TT 


321 


HEX HAD4+2 3 
45 947 


CTACTCTGGCACTGCCTACAAC 


322 


ATGTAATTGGGTCTGTTAGGC 
AT 


323 


HEX HAD2 772 
865 


CAATCCGTTCTGGTTCCGGATG 
AA 


324 


CTT6CC6GTCGTTCAAAGAG6 
TAG 


325 


HEX HAD7+4+2 
1 73 179 




326 


CGGTCGGTGGTCACATC 


327 


HEX HAD7+4+2 
1 1~54 


ATGGCCACCCCATCGATG 


328 


CTGTCCGGCGATGTGCATG 


329 


HEX HAD7+4+2 
1 1612 1718 


GGTCGTTATGTGCCTTTCCACA 
T 


330 


TCCTTTCTGAAGTTCCACTCA 
TAGG 


331 


HEX HAD7-f4-i-2 
1 2276 2368 


ACAACATTGGCTACCAGGGCTT 


332 


CCTGCCTGCTCATAGGCTGGA 
A6TT 


333 



These primers also served to clearly distinguish those strains responsible for most 
disease (types 3, 4, 7 and 21) firom all others. DNA isolated from field samples known to contain 
adenoviruses were tested using die hexon gene PGR primers, which provided unambiguous 
5 stnun identification for all samples. A single sample was found to contain a mixture of two viral 
DNAs belonging to strains 7 and 21. 

Test results (Figure 21) showed perfect concordance between predicted and observed 
base composition signatures for each of diese samples. Classical serotyping results confirmed 
each of these observations. Processing of viral samples directly fcom collection material such as 
10 throat swabs radier tiian Scorn isolated DNA, will result in a significant increase in througIq}ut, 
eliminating the need for virus culture. 

Example 18: Broad Rapid Detection and Strain Typing of Respiratory Pathogens for 
Epidemic SurveUlance 

1 5 Genome preparation: Genomic materials tcom culture samples or swabs were prepared 

using a modified robotic protocol using DNeasy™ 96 Tissue Kit, Qiagen). Cultures of 
Streptococcus pyogenes were pelleted and transferred to a 1 .5 mL mbe containing 0.45 g of 0.7 
mm Zirconia beads (Biospec Products, Inc.). Cells were lysed by shaking for 10 minutes at a 
speed of 1 9 l/s using a MM300 Vibration Mill (Retsch, Germany). Ibe samples were 

20 centrifiiged for 5 min and the supematants transferred to deep well blocks and processed using 

the manufacture's protocol and a Qiagen 8000 BioRobot. - -■ 

PCR: PGR reactions were assembled using a Packard MPE liquid handling platform 
and were performed in SO )iL volume using 1.8 units each of Platinum Taq (Invitrogen) and 
Hotstart PFU Turbo (Stratagene) polymerases. Cycling was performed on a DNA Engine Dyad 
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(MJ Research) with cyding conditions consisting of an initial 2 min at 9S**C followed by 45 
cycles of 20 s at 95°C, 1 5 s at 58°C, and 15 s at 72'C. 

Broad-range primers: PGR primer design for base composition analysis fmm precise 
mass measurements is constrained by an vippa limit y^re ionization and accurate 
5 deconvolution can be achieved. Currently, this limit is approximately 140 base pairs. Primers 
designed to broadly conserved regions of bacterial ribosomal RNAs (16 and 23S) and the gene 
encoding ribosomal protein L3 (rpoC) are shown in Table 10. 



Table 10 
Broad Range Primer Pairs 



Target 
Gene 


Direction 


Primer 


SEQ ID 
NO 


Length of 
Anplicon 


16S_1 


F 


GGATT7VGAGACCCTGGTAGTCC 


334 


116 


16S_1 


R 


GGCCGTACTCCCCAGGCG 


335 


116 


16S_2 


F 


TTCGATGCAACGCGAAGAACCT 


335 


115 


16S_2 


R 


ACGAGCTGACGACAGCCATG 


337 


115 


23S 


F 


TCT6TCCCTAGTACGAGAGGACCGG 


338 


118 


23S 


R 


TGCTTAGATGCTTTCAGC 


339 


118 


rpoC 


F 


CTGGCAGGTATGC6TGGTCTGATG 


340 


121 


rpoC 


R 


CGCACC6TGGGTTGAGATGAAGTAC 


341 


121 



10 

Emm-typing primers: The allelic projBle of a GAS strain by MuWlocus Sequencing 
Technique (MLST) can be obtained by sequencing the mtemal fiagments of seven housekeeping 
genes. The nucleotide sequences for each of these housekeepuig genes, for 212 isolates of OAS 
(78 distmct emm types), are available (www.mlstnet). This corresponds to oas hundred different 

15 allelic profiles or unique sequence types, referred to by Enright et al. as STl -STIOO (Enrigjrt et 
al., hifection and Immunity, 2001, 69, 2416-2427). For each sequence type, we created a vutual 
transcript by concatenating sequences appropriate to their allelic profile fix)m each of the seven 
genes. MLST primers were designed using these sequences and were constrained to be within 
each gene loci. Twenty-four primer pairs were initially designed and tested against the sequenced 

20 GAS strain 700294. A final subset of six primer pairs Table 1 1 was chosen based on a theoretical 
calculation of minimal number of primer pairs that maximized resolution of between emm types. 



25 
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Table 11 

Drill-Down Primer Pairs Used in Determining emm-type 



Target 
Gene 


Direcbion 


Primer 


SEQ ID 
MO 


Length of 
Anpllcon 


gki 


F 


GGGGATTCAGCCATCAAAGCAGCTATTGA 


342 


116 


gki 


R 


CCAACCTTTTCCACAACA6AATCAGC 


343 


116 


gtr 


F 


CCTTACTTCGAACTATGAATCTTTTGGAA 
G 


344 


115 




R 


CCCATTTTTTCACGCATGCTGAAAATATC 






inurJ 


F 


CGCAAAAAAATCCAGCTATTA6C 


346 


118 


murl 


R 


AAACTATTTTTTTAGCTATACTCGAACAC 


347 


118 


rautS 


F 


ATGATTACAATTCAAGAAGGTCGTCACGC 


348 


121 


nuts 


R 


TTGGACCTGTAATCAGCTGAATACTGG 


349 


121 


xpt 


F 


6AT6ACTTTTTA6CTAATGGTCAGGCAGC 


350 


122 


xpt 


R 


AATCGACGACCATCTTGGAAAGATTTCTC 


351 


122 


yqiL 


P 


6CTTCA6GAATCAATGATGGAGCAG 


352 


119 


yglL 


R 


GGGTCTACACCTGCACTTGCATAAC 


353 


119 



Microbiology: GAS isolates were identified from swabs on the basis of colony 
5 morphology and beta-hemolysis on blood agar plates, gram stain characteristics, susceptibility to 
bacitracin, and positive latex agglutination reactivity with gro\ip A-specifio antiserum. 

Sequencing: Bacterial genomic DNA samples of all isolates were extracted from fieshly 
grown GAS strains by using QIAamp DNA Blood Mini Kit (Qiagen, Valencia, CA) according to 
the procedures described by Ifae manu&cture. Group A streptococcal cells were subjected to PCR 
1 0 and sequence analysts using emm-gene specific PCR as previously described (Beall et al., J. 
Clin. Micro., 1996, 34, 953-958; and Facklam et al., Emerg. Infect Dis.. 1999, 5, 247-253). 
Homology searches on DNA sequences were conducted against known emm sequences present 
in (www.cdc.gov/ncidod^iotech^mfotech_hp.html). For MLST analysis, internal ftj^ents of 
seven housekeeping genes, were amplified by PCR and analyzed as previously described 
15 (Enright et al., Infection and Immunity 2001, 69, 2416-2427). The emm-type vras determined 
from comparison to the MLST database. 

Broad Range Surv^/Drill-Down Process (100): For Streptococcus pyogenes, the 
objective was tiie identification of a signature of the virulent epidemic strain and determination 
of its emm-type. Emm-type information is useful both for treatment considerations and epidemic 
20 surveillance. A total of 5 1 throat swabs was taken botii fix>m healthy recruits and from 

hospitalized patients in December 2002, during die peak of a GAS outbreak at a military training 
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camp. Twenty-seven additional isolates from pievious infections ascribed to OAS were also 
examined. Initially, isolated colonies were «camined botii from throat culture samples and throat 
swabs directiy without the culture step. The latter path can be completed witiiin 6-12 hours 
providing information on a significant number of samples rapidly enough to be useful in 

S managing an ongoing epidemic. 

The process of broad range survey/drill-down ^00) is shown in Figure 22. A clinical 
sample such as a throat swab is IBrst obtamed from an individual ^Oi). Broad range survey 
primers are used to obtain amplificadon products from tiie cUnical sample (202) which are 
analyzed to determine a BCS (203) from which a species is identified ^04). Drill-down primers 

10 are tiien ^ployed to obtain PGR products 02OS) from which specific mformation is obtained 
about the species (such as Emm-type) (206). 

Broad Range Survey Priming: Genomic regions targeted by the broad range survey 
primers were selected for their ability to allow amplification of virtually all known species of 
bacteria and for their capability to distinguish bact^al species from each other by base 

1 5 composition analysis. Initially, four broad-range PGR target sites were selected and the primers 
were synthesized and tested. The targets mcluded universally ccoiserved regions of 16S and 23S 
rRNA, and the gene encoding ribosomal protein L3 (rpoC). 

While there was no special consideration of Streptococcus pyogenes in the selection of 
the broad range survey primers (which were optimized for distinguishing all important pathogens 

20 from each other), analysis of genomic sequences showed that the base compositions of these 
re^ons distinguished StreiHococcus pyogenes from other respiratory pathogens and normal flora, 
mcludmg closely related species ot streptococci, sttphylococci, and badlli (Figure 23). ^ 
Drill Down Priming (Emm-Typing): In order to obtain strain-specific information about 
the q)idemic, a strategy was designed to measure the base compositions of a set of fiist clock 

25 target genes to generate strain-specific signatures and sunultaneously correlate with emm-types. 
In classic MLST analysis, mtemal fragments of seven housekeeping genes (gki, gtr, murl, mutS, 
recP, Tqit, yqiL) are an^>lified, sequenced and compared to a database of previously studied 
isolates vibose emm-types have beoi determmed (Homer et al. Fundamental and Applied 
Toxicology, 1997, 36, 147). Smce the analysis enabled by die present embodiment of die present 

30 invention provides base composition data rather than sequence data, the challenge was to identify 
tiie tarjget regions-lfaat im)vide the highest resolution of specie and least ambigiioiiaemin- 
classification. The data set from Table 2 of Enright et al. (Enright et al. Infection and Immunity, 
2001, 69, 2416-2427) to bioinformatically construct an alignment of concatenated alleles of die 
seven housekeeping genes from each of 212 previously emm-typed strams, of which 101 were 



wo 2004/060278 



57 



PCT/US2003/038761 



unique sequences that repiesented 75 distinct enun-types. This alignment was then analyzed to 
determine the number and location of the q)timal primer pairs that would maximize strain 
disoimination strictly on base composition data. 

An example of assignment of BCSs of PCIL products is shown in Figure 24 v/bets PGR 
5 products obtained using the gtr primer (a drill-down «imi-^ing primer) fiom two different 
swab samples were analyzed (sample 12 - top and sample 10 - bottom). The deconvoluted ESI- 
FCTIR spectra provide accurate mass measuremoits of both strands of the PGR products, fix>m 
which a series of candidate BCSs were calculated from fhs measured mass (and within the 
measured mass uncertainty). The identification of complonentary candidate BCSs &om each 

10 strand provides a means £>r unambiguous assignment of fbe BCS of the PGR product. BCSs and 
molecular masses for each strand of the PGR product &om the two different samples are also 
shown in Figure 24. In this case, the determination of BCSs for the two samples resulted in tiie 
identification of the emm-type of Streptococcus pyogenes — sample 12 was identified as ouun- 
. type 3 and sample 10 was identified as emm-type 6. 

.15 The results of Has composition analysis using the six primer pairs, 5'-emm gene 

sequencing and MLST gene sequencing method for the GAS epidemic at a military training 
, facility are compared in Figure 25. The base composition results for the six primer pairs showed 
a perfect concordance with S'-emm gene sequencing and MLST sequencing methods. Of the 5 1 
samples taken during the peak of the epidemic, all but three bad identical compositions and 

20 corresponded to emm-type 3. The three outliers, all fiwm healtiiy individuals, probably represent 
non-epidemic strains harbored by asymptomatic carriers. Samples 52-80, which were archived 
fiom previous infectibns ftom Marines at other naval training facilities, showed a much greater 
heterogeneity of composition signatures and emm-types. 

25 Example 19: Base Composition Probability Clouds 

Figure 18 illustrates the concept of base composition probability clouds via a pseudo- 
four dimensional plot of base compositions of enterobacteria including Y. pestis, Y. 
psuedottAerculosis, S. typhimuriunt. S. tjpM. Y. enterocolitica, K coHK.lZ bdAE. colt 
OlSl'HI. In tiie plot of Figure 18, A, C and G compositions correspond to ^ x, y and z axes 

30 respectively whereas T compositions are repres^ted by the size of the sphere at fiie junction of 
tiie x, 'y and z coordinates. There is ho absolute r^uirement for Mvihg a paificulariiucleobase 
composition associated with a particular axis. For example, a plot could be designed wherein G, 
t and C conq>ositions correspond to die x, y and z axes respectively vi^ereas the A composition 
GOiiesponds to the size of the sphere at the junction of the x, y and z coordinates. Furthermore, a 
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difiTeient representation can be made of flie "pseudo fourth" dimoision i.e.: other than the size of 
ttie sphere at junction of the x, y and z coordinates. For sample, a symbol having vector 
information such as an arrow or a cone can be rotated at an angle Aat varies proportionally with 
the composition ofthenucleobase corresponding to tlwpseudo fourth dimension. Thedioiceof 

5 axes and pseudo fourth dimensional representation is typically made with flie aim of optimal 
visualization of tiie data bemg presented. 

A similar base composition probability cloud analysis has been presented for a series of 
viruses in U.S. provisional patent application Serial No. 60/4313 19, which is commonly owned 
and incorporated herein by reference m its entirety. In tiiis base composition probability cloud 

10 analysis, the closely related Dengue virus types 1-4 are clearly distinguishable fiom eadi other. 
This example is indicative of a challenging scenario for species identification based on BCS 
analysis because RNA viruses have a high mutation rate, it would be expected to be diflBcuh to 
resolve closely related species. However, as this example illustrates, BCS analysis, aided by 
base composition probability cbud analysis is capable of resolution of closely related vhal 

15 species. 

A base composition probability cloud can also be represented as a three dimensional 
plot instead of a pseudo-four dimensional plot An example of such a three dimensional plot is a 
plot of G, A and C compositions correspond to the x, y and z axes respectively, while the 
composition of T is left out of the plot. Another such example is a plot where tiie compositions 
20 of all fow nucleobases is included: G, A and C+T compositions correspond to the x, y and z axes 
respectively. As for the pseudo-four dimensional plots, the choice of axes for a three dimensional 
plot is typically made with the dm of optimal visualization of the data bdng presented. 

Example 20: Biochemical Proc^sing of Large Amplification Products for Analysis by Mass 
25 Spectrometry 

hi tile example illustrated m Figure 26, a primer pair ^ch amplifies a 986 bp region of 
tiie 168 ribosomal gene m E. coli (K12) was digested witii a mbcture of 4 restriction enzymes: 
BstNl, BmFl, Bfal, hndNcoL Figure 26(a) illustiates the complexity of flie resulting ESI- 
FTIGl mass spectrum that contains multiple charge states of muitq>le restriction fragments. 
30 Upon mass deconvolution to neutral mass, the spectrum is significantiy simplified and discrete 
■ bfigonucleotidepaurs are evident (Figure 26b). Whrabase compositiomj^ " 
masses of the restriction fragments, perfect agreement is observed for the known sequence of 
nucleotides 1-856 (Figure 26c); the batch of Ncol enzyme used in this experiment was inactive 
and resulted in a missed cleavage site and a 197-mer fragment went undetected as it is outside 
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the mass range of the mass spectrometer under the conditions employed. Interestingly however, 
both a forward and reverse strand were detected for each fragment measured (solid and dotted 
lines in, respectively) within 2 ppm of the predicted molecular weights resulting in unambiguous 
determination of the base composition of 788 nucleotides of the 985 nucleotides in the amplicon. 
5 The coverage map ofFeis redundant coverage as both 5' to 3' and 3' to 5' fiagments are detected 
for fragments covering the fust 8S6 nucleotides of tiie amplicon. 

This approach is in many ways analogous to those widely used in MS-based proteomics 
studies in which large intact proteins are digested with trypsin, or otiier proteolytic enzyme(s), 
and tile identity of the proteia is derived by comparing the measured masses of the tryptic 

10 peptides witii theoretical digests. A unique feature of this approach is that tiie precise mass 
measurements of the complementary strands of each digest product allow one to derive a de 
novo base composition toe each fragment, which can in turn be "stitohed together" to derive a 
complete base composition for the larger amplicoiL An important distinction between this 
approach and a gel-based restriction mappmg strategy is that, in addition to determination of die 

1 S length of each fragment, an unambiguous base composition of each restriction fragmra.t is 
derived. Thus, a single base substitution wifliin a fragment (^n*ich would not be resolved on a 
gel) is readily observed using this approach. Because this study was performed on a 7 Tesla ESI- 
FTICR mass spectrometer, better than 2 ppm mass measurement accuracy was obtained for all 
fiagments. Interestingly, calculation of the mass measurement accuracy required to derive 

20 unambiguous base compositions from the complementary fragments indicates that the higjiest 
mass measurement accuracy actually required is only 15 ppm for the 139 bp fragment 
(nucleotides 525-663). Most of the fragments were in the 50-70 bp size-range which would 
require mass accuracy of only ~50 ppm for unambiguous base composition detenninatioa This 
level of performance is achievable on other more compact, less ejqjensive MS platforms such as 

25 the ESI-TOF suggesting that the metiiods developed here could be widely deployed in a variety 
of diagnostic and human forensic arenas. 

This example illustrates an alternative approach to derive base compositions from larger 
PCR products. Because tiie amplicons of interest cover many strain variants, for some of which 
complete sequences are not known, each amplicon can be digested under several different 

30 enzymatic conditions to ensure tiiat a diagnostically informative region of the amplicon is not 
obscured by s "blind spof * which arises from a mutation in a restriction site. The extfcnt of 
redundancy required to confidentiy map the base composition of amplicons from different 
markers, and determine vdiich set of restriction enzymes should be employed and how they are 
most effectively used as mixtures can be determined. These parameters will be dictated by the 
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extetA to yMch fbe area of ioteiest is consorved across (he amplified legion, the compatibility of 
the various restriction enzymes with respect to digestion protocol (buffer, temperature, time) and 
the degree of coverage required to discriminate one amplicon from another. 
Example 21: Identification of members of the VinA Genus Orthopoxvirus 

5 Primer sites were identified on three essential viral genes - the DNA-dependent 

polymerase (DdDp), and two sub-units of DNA-dependent RNA polymerases A and B (DdRpA 
and DdRpB). These intelligent primers designed to identify members of the viral genus 
Orthopoxvirus are shown in Table 12 wherein Tp = 5*propynylated uridine and Cp = 
5'propynylated cytidine. 

10 Table 12 

IntelKgent Primer Pairs for Identification of members of the 



Viral Genus Orthopoxvirus 



Pzimer Pair 
Hame 


Forward Primer 
Sequdnee 


Forward 
SEQ ID 
NO: 


BeTerse Pxiaer 
Saguenoe 


Reverse 
SEQ ID 
NO: 


A25L HC00161 
1 28 127 


GTACTGAATCCGCCTAAG 


354 


6TGAATAAAGTATCGCCCTAA 
TA 


355 


AISR MC00161 
1 100 207 


GAAGTTGAACCGGGATCA 


356 


ATTATCGGTCGTTGTTAATGT 


357 


A18R NC00161 
1 1348 1445 


CTGTCTGTAGATAAACTAGGAT 
T 


358 


CGTTCTTCTCTGGAGGAT 


369 


E9L NC001611 
1119 1222 


CGATACTACGGACGC 


360 


CTTTATGAATTACTTTACATA 
T 


361 


K8R NC001611 
221 311 


CTCCTCCATCACTAGGAA 


362 


CTATAACy^TTCAAAGCTTATT 
G 


363 


A24R NC00161 
1 795 878 


CGCGArAATAGATAGTGCTAAA 
C 


364 


GCTTCCACCAGGTCATTAA 


365 


A25L NC00161 
1 28 127 P 


GTACpTpGAATpCpCpGCpCpT 
AAG 


366 


GTGAATAAAGTATpCpGCpCp 
CpTpAATA 


367 


A18H NC00161 
1 100 207P 


GAAGTpTpGAACpCpGGGATCA 


368 


ATTATCGGTpCpGTpTpGTpT 
pAATGT 


369 


A18R NC00161 
1 1348 1445P 


CTGTpCpTpGTAGATAAACpTp 
AGGATT 


370 


CGTTCpTpTpCpTpC4>TpGGA 
GGAT 


371 


E9L NC001611 
1119 1222P 


CGATACpTpACpGGACGC 


372 


CTTTATGAATpTpACpTpTpT 
pACATAT 


373 


K8R NC001611 
221 Slip 


CTpCpCpTCpCpATCACpTpAG 
GAA 


374 


CTATAACATpTpCpAAAGCpT 
pTpATTG 


375 


A24R NC00161 
1 795 878P 


CGCGATpAATpAGATAGTpGCp 
TpAAAC 


376 


GCTTCpCpACpCAGGTpCATp 
TAA 


377 



As illustrated in Figure 27, members of the Orthopoxvirus genus group can be 
15. identified, distinguished fiom one anothar, and distingiiished firom other members of the 
Poxvirus femily using a single pair of primers designed against the DdRpB gene. 

Since the primers were designed across regions of high conservation within this genus, 
the likelihood of missed detection due to sequence variations at these sites is minimized. Further, 
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none of the primers is expected to amplify o&er viruses or any other DNA, based on the data 
available in GenBank. This mefliod can be used for all femiUes of viral threat agents and is not 
limited to members of die Orthopoxvirus genus. 

S Example 22: Identification of Virases that Cause Viral Hemorrliagic Fevers 

In accordance with flie present invention an approach of broad PGR primmg across 
several different viral species is employed using conserved regions in the various viral genomes, 
amplifying a small, yet highly mformative region m these organisms, and then analy2jng the 
resultant amplicons wiA mass spectrometry and data analysis. These regions will be tested with 

10 live agents, or widi genomic constructs thereof 

Detection of RNA viruses wUl necessitate a reverse transcription (RT) step prior to the 
PGR amplification of the TIGER reporter amplicon. To maximize throughput and yield while 
muiimizing the handling of the samples, commercial one-step reverse transcription polymerase 
chain reaction (RT-PCR) kits will be evaluated for use. If necessary, a one-step RT-PCR mix 

15 using our selected DNA polymerase for tbs PGR portion of the reaction will be developed. To 
assure there is no variation in our reagent performance aU new lots of enzymes, nucleotides and 
buffers will be individually tested prior to use. 

Various modifications of the invention, in addition to those described herein, will be 
apparent to those skilled in the art from the foregoing description. Such modifications are also 

20 intended to fall within the scope of the appended claims. Each reference, web site, Genebank 
accession number, etc. cited in the present application is incorporated herein by reference in its 
entirety. The following US applications are incorporated herein by reference in their entirety: 
Serial No. 10/323,233 filed December 18, 2002, Serial No. 10/326,051 filed December 18, 2002, 
Serial No. 10/325,527 filed December 18, 2002, Serial No. 10/325,526 filed December 18, 2002, 

25 Serial No. 60/431,319 filed December 6, 2002, Serial No. 60/443,443 filed January 29, 2003, 
Serial No. 60/443,788 filed January 30, 2003, Serial No. 60/447,529 filed February 14, 2003, 
and Serial No. 60/501,926 filed September 1 1, 2003. 
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Whatisdaimedis: 

1. A method of identifying a plurality of etiologic agents of disease in an individual 
comprising die steps of: 

amplifying at least one nucleic acid molecule obtained finm a biological sample fiom 
5 the mdividual with a plurality of intelligent primers to obtain a plurality of amplification 
products corresponding to the plurality of etiologic agents; and 

determining the molecular masses of the plurality of amplification products, herein tiie 
molecular masses identify tiie plurality of etiologic agents and wherein 4ie intelligent immers are 
broad range survey primers, division-wide primers, drill-down primers, or any combination 
10 thereof. 

2. A method of claim 1 wherem identification of at least one of the plurality of etiologic 
agents is accomplished at the genus or species level, and the intelligent primers axe broad range 
survey primers, division-wide primers, or any combination thereof. 

3. A method of claim 1 wherein a subspecies characteristic of at least one of the plurality 
1 5 of etiologic agents is obtained using drill-down primers. 

4. A method of claim 3 wherein the subspecies characteristic is serotype, strain type, sub- 
strain type, sub-species type, emm-type, presence of a bioengineered gene, presence of a toxm 
gene, presence of an antibiotic resistance gene, presence of a pathogenicity island, or presence of 
a virulence factor. 

20 5. A metiiod of claim 1 wherein the molecular mass is determined by mass spectrometry. 

6. A method of claim S vdierein the mass spectrometiy is Fourier transform ion cyclotron 
resonance mass spectrometry OPTICR- MS), ion trap, quadrupole, magnetic sector, time of flight 
(TOF), Q-TOF, or triple quadrupole. 

7. A method of claim 1 wherein tiie molecular masses are used to determine the base 
25 compositions of the amplification products and wherein the base compositions identify the 

patiiogen. 

8. A method of in silica screening of intelligent primer sets for identification of a plurality 
of bioagents comprising the steps of: 

preparing a base composition probability cloud plot fix>m a plurality of base 
30 composition signatures of the plurality of bioagents generated in silico; 

inspecting the base composition probability cloud plot for overlap of clouds fix)m 
different bioagents; and 

selecting primer sets based on minimal overly of the clouds. 

9 . A method of perfOTming epidemic surveillance comprising the steps of: 
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determining tlie molecular mass of die amplification product, wherein said molecular 
mass identifies the pathogen in die biological sample. 

28. A mediod of claim 27 ^^rein tbe pathogen is a bacterium, a virus, a protozoan, a 
parasite, a mold, or a fungus. 
5 29. A method of claim 27 vt^ierein die biological sample is blood, mucus, hair, urine, 
breath, sputum, saliva, stool, nail, or tissue biopsy. 

30. A method of claim 27 herein die biological sample is otoined fiom an animal. 

31. A method of claim 30 wherein the animal is a human. 

32. A mediod of claim 27 v^erein die intelligent primers are broad range survey primers, 
10 division-wide primers, or drill-down primers. 

33. A method of claun 32 wherein identification of the pathogen is accomplished at the 
genus or species level, and v^erein the intelligent primers are broad range survey primers or 
division-wide primers. 

34. A method of claim 32 wherein a subspecies characteristic of the pathogen is obtained 
15 using drill-down pruners. 

35. A method of claim 34 wherein tbe subspecies characteristic is serotype, strain type, 
sub-strain type, sub-species type, emm-type, presence of a bioengine^»d gene, presence of a 
toxin gene, presence of an antibiotic resistance gene, presence of a pathogenicity island, or 
presence of a virulence factor. 

20 36. A method ofclaim 27 wherein the molecular mass is detemiined by mass spectrometry. 

37. A method of claun 36 wherein the mass spectrometry is Fourier transform ion cyclotron 
resonance mass spectrometry (FTICR- MS), ion txap, quadrupole, magnetic sector, time of flight 
(TOF), Q-TOF, or triple quadrupole. 

38. A mediod of claim 27 herein the intelligent primers are targeted to ribosomal RNA or 
25 housekeqiing genes. 

39. A method of claim 27 viierein die molecular mass is used to determine the base 
composition of said amplification product and wherein said base composition identifies said 
pathogen. 

40. An mtelligent primer pair wherem each member of the pair has at least 70% sequence 
30 identity with die sequence of die corresponding member of any one of the following intelligent 

primer pair sequences: SEQ ID NOs: 8:9, 10:11, 12:13, 14:15, 16:17, 18:19, 20:21, 22:23, 24:25, 
26:27, 2829, 30:31, 32-33, 34:35, 36:37, 38:39. 40:41, 42:43. 44:45, 46:47, 48:49, 50:51, 52:53, 
54:55, 56:57, 58:59, 60:61, 62:63, 64:65, 66:67. 68:69, 70:71, 72:73, 74:75, 76:77, 78:79, 80:81, 
82:83, 84:85, 86:87, 88:89, 90:91, 92:93, 94:95, 96:97, 98:99, 100:101, 102:103, 104:105, 
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106:107. 108:109, 110:111. 112:113, 114:115, 116:117, 118:119. 120:121, 122:123, 124:125. 
126:127, 128:129. 130:131, 132:133, 134:135, 136:137, 138:139. 140:141, 142:143, 144:145, 
146:147, 148:149, 150:151, 152:153. 154:155, 156:157. 158:159, 160:161, 162:163, 164:165, 
166:167, 168:169, 170:171, 172:173, 174:175, 176:177, 178:179, 180:181, 182.;183, 184:185, 

5 186:187, 188:189, 190:191, 192:193, 194:195, 196:197, 198:199, 200:201, 202:203, 204:205, 
206:207, 208:209, 210:211, 212:213, 214:215, 216:217, 218:219, 220:221, 222223, 224:225, 
226:227, 228:229, 230:231, 232:233, 234:235, 236:237, 238:239, 240:241, 242:243, 244:245, 
246:247, 248:249, 250:251, 252:253, 254:255, 256:257, 258:259, 260:261, 262:263, 264:265, 
266:267, 268:269. 270:271, 272:273, 274:275. 276:277. 278:279. 280:281, 282:283, 284.285, 

10 286:287, 288:289, 290:291, 292:293, 294:295, 296:297, 298:299, 300:301. 302:303. 304:305, 
306:307, 308:309, 310:311, 312:313, 314:315, 316:317, 318:319, 320:321, 322:323, 324:325, 
326:327, 328:329, 330:331, 332:333, 334:335, 336:337, 338:339, 340:341, 342:343, 344:345, 
346:347, 348:349, 350:351, 352:353, 354:355. 356:357, 358:359, 360:361, 362:363, 364:365, 
366:367. 368:369, 370:371, 372:373, 374=375, or 376:377. 

15 41 . The intelligent primer pair of claim 40 comprising at least one modified nucleobase. 

42. The intelligent primer pair of claim 41 v^rein the modified nucleobase is 5- 
propynylcytidine or 5-propynyluridine. 

43. - A bioagent identifjdng amplicon comprising an isolated polynucleotide of about 45 to 
about 150 nucleobases in lengfli produced by the process of amplification of nucleic acid from a 

20 bioagent with a pair of intelligent primers wherein each intelligent primer is of a length of about 
12 to about 35 nucleobases, wherein the bioagent identifying amplicon provides identifying 
information about the bioagent 

44. The bioagent identifying amplicon of claim 43 Mdwrdn each member of the pair has at 
least 70% sequence identity witii tiie sequence of tiie corresponding member of any one of the 

25 following intelligent primer pair sequences: SEQ ID NOs: 8:9, 10:1 1, 12:13, 14:15, 16:17, 18:19, 
20:21, 22:23, 24:25. 26:27, 28:29, 30:31, 32:33, 34:35. 36:37, 38:39, 40:41, 42:43, 44:45, 46:47, 
48:49, 50:51, 52:53, 54:55, 56:57, 58:59, 60:61, 62:63, 64:65, 66:67. 68:69. 70:71, 72:73. 74:75, 
76:77, 78:79, 80:81, 82:83, 84:85, 86:87, 88:89, 90:91, 92:93, 94:95, 96:97, 98:99, 100:101, 
102:103, 104:105, 106:107. 108:109, 110:111, 112:113, 114:115, 116:117, 118:119, 120:121, 

30 122:123, 124:125, 126:127, 128:129, 130:131, 132:133, 134:135, 136:137, 138:139, 140:141, 
142:143, 144:145, 146:147, 148:i49, 150:151, 152:153,' 154:155. 156:157, 158:159, 160:161, 
162:163. 164:165, 166:167, 168:169, 170:171, 172:173, 174:175, 176:177, 178:179, 180:181. 
182:183, 184:185, 186:187, 188:189, 190:191, 192:193, 194:195, 196:197, 198:199, 200:201, 
202:203, 204:205, 206:207, 208:209, 210:211, 212:213, 214:215, 216:217, 218:219, 220:221, 
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222:223, 224:225, 226:227, 228:229. 230:231, 232:233. 234:235, 236:237, 238:239, 240:241, 
242-^43, 244:245, 246:247, 248:249, 250:251, 252:253, 254.255, 256:257, 258:259, 260:261, 
262263, 264:265, 266:267, 268.269, 270:271, 272-273, 274275, 276277, 278:279, 280:281, 
282283, 284:2«5, 286:287, 288289. 290:291.. 292:293. 294295, 296:297, 298:299, 300:301; 
5 302:303, 304:305, 306:307, 308:309, 310:311. 312:313, 314:315, 316:317, 318:319, 320.321, 
322:323, 324:325, 326:327, 328:329, 330:331, 332:333, 334:335, 336:337, 338:339, 340:341, 
342:343, 344:345, 346:347, 348:349, 350:351, 352:353, 354:355, 356:357, 358:359, 360:361, 
362:363, 364:365, 366:367, 368:369, 370:371, 372:373. 374:375. or 376:377. 
45. A bioagent identifying amplicon for identification of a bacterium comprising an isolated 
10 polynucleotide of about 45 to about 150 nucleobases in length produced by tiie process of 
amplification of nucleic acid encoding ribosomal RNA fiom a bacteri^um vdth a pair of 
intelligent primers wherein each intelligent primer is of a length of about 12 to about 35 
nucleobases, v^erein the bioagent identifying amplicon provides identifying information about 
the bioagent. 

15 46. The bioagrait identifying amplicon of claim 45 vtiierdn each member of the pair has at 
least 70% sequence identity with the sequence of the corresponding member of any one of the 
following intelligent primer pair sequences: SEQIDNOs: 8:9, 10:11, 12:13, 14:15, 16:17, 18:19, 
20:21, 22:23, 24:25, 26:27, 28:29. 30:31. 32:33, 34:35, 36:37, 38:39, 40:41, 42:43, 44:45. 46:47, 
48:49, 50:51, 52:53, 54:55, 56:57, 58:59, 60:61, 62:63. 64:65, 66:67, 68:69, 70:71, 72:73, 74:75, 

20 76:77, 78:79, 80:81, 82:83, 84:85, 86:87, 88:89, 90:91, 92:93, 94:95, 96:97, 98:99, 100:101, 
102:103, 104:105, 106:107. 108:109, 110:111, 112:113. 114:115, 116:117, 118:119. 120:121, 
122:123, 124:125, 126:127, 128:129, 130:131, 132:133, 134:135. 136:137. 138:139, 140:141. 
142:143, 144:145, 146:147, 148:149. 150:151, 152:153, 154:155, 156:157, 158:159, 160:161. 
162:163, 164:165, 166:167, 168:169. 170:171, 172:173, 174:175, 176:177, 178:179, 180:181. 

25 182:183, 184:185, 186:187. 188:189, 190:191, 192:193, 194:195, 196:197, 198:199, 200:201, 
202:203, 204:205, 206.207, 208:209, 210211, 212213. 214215, 216217, 218219, 220:221, 
222:223, 224:225, 226:227, 228:229, 230:231, 232:233, 234:235, 236237, 238:239, 240:241, 
242:243. 244:245, 246:247, 248249, 250251, 252:253, 254:255, 256:257, 258259, 260:261, 
262:263, 264:265. 266:267, 268:269, 270271, 272273, 274:275, 276277. 278:279, 280:281, 

30 282:283, 284:285, 334:335, 336:337, 338:339, 340:341, 342:343, 344:345, 346:347, 348:349, 
350:351, or 352:353. ' - : . 

47. A bioagent identifying amplicon for identification of a virus comprising an isolated 
polynucleotide of about 45 to about 150 nucleobases in length produced by tbie process of 
amplification of nucleic acid encoding a viral housekeeping gene with a pah: of intelligent 
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primeis wherein each intelligent primer is of a length of about 12 to about 35 nucleobases, 
i^rein the bioagent identifying amplicon pro>ddes identifying information about the bioagent. 
48. The bioagent identifying amplicon of claim 47 wherein each membw of the pair has at 
least 70% sequence identity witti the sequence of the corresponding member of any one of the 
5 foUowmg hdelUgent primer pair sequences: SEQ ID NOs: 286:287, 288:289, 290:291, 292:293, 
294:295, 296:297. 298:299, 300:301, 302:303, 304:305, 306:307, 308:309, 310:311, 312:313, 
314:315, 316:317, 318:319, 320321, 322:323, 324:325, 326:327, 328:329, 330:331, 332:333, 
354:355, 356:357, 358:359, 360361, 362:363, 364:365, 366:367, 368:369, 370:371, 372:373, 
374:375, or 376:377. 

10 49. The method of claim 48 wherein said viral housekeeping gene is hexon, DNA- 
dependent polymerase, DNA-dependent RNA polymerase A, or DNA-dependent RNA 
polymerase B. 



wo 2004/060278 



1/38 



PCT/US2003/038761 
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FIG. lA-4 
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FIG. 16 
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FIG. 17 
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Figure 18 
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Figure 19 
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Figure 22 
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SEQUENCE LISTING 

<110> ISIS Pharmaceuticals, Inc. 
Eoker, David J. 
Griffey, Richard B, 
Sampath, Rangarajan 
Hofstadler, Steven 
McNeil, John 
Crooke, Stanley T. 
Blyn, Lawrence B. 
Ranken , Raymond 
Hall, Thomas A. 

<120> METHODS FOR RAPID IDENTIFICATION OF PATHOGENS IN HUMANS AND ANIMALS 

<130> IBIS0060-500WO (DIBIS-0044WO) 

<150> 60/431,319 
<151> 2002-12-06 

<150> 10/323,233 
<151> 2002-12-18 

<1S0> 10/326,051 
<151> 2002-12-18 

<150> 10/325,527 
<151> 2002-12-18 

<150> 10/325,526 
<151> 2002-12-18 

<150> 60/443,443 
<151> 2003-01-29 

<150> 60/443,788 
<151> 2003-01-30 

<150> 60/447,529 
<151> 2003-02-14 

<150> 60/501,926 
<151> 2003-09-11 
<160> 377 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1388 
<212> RNA 

<213> 16S rRNA Consensus Sequence 

<220> . „ 

<22l^ misc_f6aLture . .. 

<222> 1-7, 15, 22-24, 36-38, 41, 42,44, 56, 59-90, 93, 97, 98, 109, 
110, 112-116, 118-120, 123-131, 134,136, 138-144, 149-155, 161, 
162, 164-177, 182-209, 212-220, 222-225, 227, 230, 231, 235, 236, 
241, 245, 246, 253, 255-257, 260, 261-263, 257, 269, 270, 278, 279, 
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281, 282, 284, 291, 294, 301, 306, 310, 329, 330, 335, 336-338, 
344, 345, 347, 350, 351, 355, 356, 357, 361,363, 364, 371, 372, 
373, 376, 379, 382-386, 388, 389, 394-396, 398-405, 408, 411-438, 
442, 443, 445-451, 453, 454, 458-460, 465, 469, 491, 495, 496, 499, 
504-506, 511, 512, 514, 524, 526-528, 530, 534, 537-544, 546-550, 
552, 556-562, 565, 569-578, 580-586, 589-595, 597, 601-606, 609, 
612-617, 621-624, 629, 633, 636, 639, 643, 645, 646, 648, 650, 654, 
658-660, 669-674, 678-683, 689, 691, 693-696, 704, 708, 713, 734, 
737, 738, 744, 746-754, 756-758, 760-782, 784-786, 791-793, 
796-800, 815, 816, 823-825, 834, 845, 848, 857, 859, 864, 869, 875, 
877, 878, 884, 886, 894-898, 903-917, 921, 922-948, 955, 961, 972, 
973, 978, 990, 1005-1013, 1015, 1017, 1019, 1021-1029, 1031, 1033, 
1037-1043, 1049-1051, 1053, 1054, 1057-1059, 1069, 1075, 1083, 10B5, 
1089, 1094, 1096-1099, 1104, 1110, 1111, 1119-1123, 1127, 1128, 
1130, 1132, 1133, 1136, 1138, 1139-1141, 1143, 1144, 1146-1150, 
1154, 1157, 1159-1162, 1166-1170, 1172, 1173, 1176, 1181, 1183-1186, 
1195-1198, 1200, 1205, 1206, 1210, 1220-1222, 1227, 1229, 1231-1233, 
1244, 1249, 1266-1268, 1271, 1273, 1274, 1277-1285, 1288, 1289, 
1293-1304, 1306-1311, 1313-1322, 1324, 1326, 1329, 1330-1338, 1341, 
1345-1347, 1361, 1364, 1366-1368, 1372, 1373, 1375, 1378, 1388 



<223> n = A,T,C or G 
<400> 1 

nnnnnnnaga ggacnggcca gnnngaacgc 
nnnnnnnnnn nnnnnnnnnn nnnniuinnnn 
ccnnnnnnnn nggnanannn nnnngaaann 
gnnnnnnnnn nnnnnnnnnn nnnnnnnnng 
nggcnnacca agncnnngan nnnagcngnn 
nacggnccan acccacggga ggcagcagnn 
nannccgcgg nnngangang gnnnnngnng 
nnnnnnnnnn nnnnnnnnga cnnannnnnn 
cgcggaaacg nagghngcna gcgnnncgga 
nnnngnnnnn gnaaannnnn nngcnaacnn 
nnnnnnagng gnnnnnngaa nnnnggigng 
gcgaaggcnn nnoncggnnn nnnacgacnc 
gaacccggag ccangcnnaa acgngnnnnn 
nnannnaacg nnnaannnnn ccgccgggga 
ggggnccngc acaagcngng gagnagggna 
cannnnnnnn nnnnnnngan annnnnnnnn 
ngcgcagccg gnnggagngg ggaagcccgn 
nnnnnnnnng ngnaccnnnn nnnacgccnn 
aancncagnc ccangnnnng ggcncacacn 
ngnnannnnn agcnaancnn nnaaannnnn 
gaagnnggan cgcagaacgn nnacagnang 
ccgcannnca ngnnagnnnn nnnnnccnna 
nncnanggnn nnnnnnnnga ngggnnnaag 
acaccccn 

<210> 2 
<211> 2654 
<212> RNA 

<213> Artificial Sequence 
<220> 



ggcggnnngc nnanacagca agcgancgnn 60 
agnggcnnac ggggagaann cnnnnnannn 120 
nnnnnaaacc nnannnnnnn nnnnnnnaaa 180 
annnnnnnnn gnnnnanagn ngggnnggaa 240 
cgagaggnng nncngccaca nggnacgaga 300 
ggaannnnca aggnngnaan ncgannnagc 360 
aaannncnnn nnnnnganga nnnnnnnnnn 420 
nannaagnnn cggcnaacnc ggccagcagc 480 
nnangggcga aagngnnngn aggnggnnnn 540 
nnnnnnnncn nnnnnnacnn nnnnncngag 600 
ggnaancgna gananngnan gaanaccnnn 660 
nannnncgaa agcngggnag cnaacaggaa 720 
nnnngnnngn nnnnnnnnnn nnnnnnnnnn 780 
gacgnncgca agnnnaaacc aaangaagac 840 
acgangnnac gcgnanaacc accnnnnnga 900 
nnnnnnnnnn nnnnnnnnac agggngcagg 960 
aacgagcgca acccnnnnnn nnngncnanc 1020 
ngnnaannng gaggaaggng gggangacgc 1080 
ncacaaggnn nnnacanngn gnngcnannn 1140 
cnnagncgga ngnnnncgca accgnnnncn 1200 
nnncgggaaa cgcncgggnc gacacaccgc 1260 
agnnnnnnnn nnnncnnnnn ngnnnnnnnn 1320 
cgaacaagga nccnannnga anngnggngg 1380 
1388 
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<221> niisc_feature 

<222> 1-4, 8-12, 16, 18-22, 31, 35-40, 43, 47, 54, 56-62, 64, 65, 67-69, 
71, 72, 74-76, 79, 80, 83, 84, 86-93, 95-99, 101, 104-106, 108, 109, 
119, 125-142, 144-171, 173-175, 178-182, 186, 194, 198-202, 208, 214, 
215, 218-221, 224, 226-229, 233, 244-246, 248, 250, 251, 254, 256-280, 
282-284, 288-292, 295, 299-307,309-311, 314, 316, 318, 319, 322, 325-328, 
332-354, 358-360, 362-365, 367, 370, 372, 373, 375-378, 380, 381, 
385-387, 389-392, 398-403, 407, 414-420, 425, 429, 431, 433, 434, 439, 
443, 451, 458, 463, 464, 465, 466, 467, 469, 479, 482, 483, 492-496, 498, 
500, 503-505, 507-509, 512-525, 528, 529, 533, 537, 539, 540, 543-545, 
547, 549, 552-561, 564, 567, 570-578, 580, 583, 586, 589, 594, 599-601, 
604, 605, 607, 611-613, 616-620, 622-625, 630, 635, 637-639, 643, 646-648, 
651, 652, 657, 662-666, 670, 672-676, 682, 689-696, 703-708, 714, 715, 
718-720, 722, 725, 730, 731, 733, 736, 738, 742-744, 746, 747, 756, 757, 
763-766, 770-773, 776-791, 794,805-814, 817-829, 832, 833, 835-842, 847, 
852, 855-870, 872, 875, 876, 878, 879, 881-883, 885, 887, 889, 892-894, 
896-898, 900, 901, 903, 908, 913, 920, 922, 923, 925-927, 932, 936, 
939-946, 952, 956, 959, 962-967, 969, 970, 972, 976-978, 983, 999, 1001, 
1002, 1008, 1009, 1015, 1022, 1023, 1025, 1028-1034, 1039, 1042, 1043, 
1045, 1047, 1052, 1056-1063, 1069-1074, 1076-1097, 1102, 1103, 1109-1121, 
1126-1132, 1134, 1135, 1137-1143, 1147-1155, 1159, 1161, 1164, 1165, 1167, 
1168, 1170, 1174, 1178-1185, 1189, 1191, 1192, 1194-1198, 1200, 1204, 
1206-1208, 1210, 1215, 1218-1223, 1225, 1227-1229, 1231-1236, 1240, 
1245-1247, 1253, 1254, 1258, 1260, 1263,1265, 1267, 1268, 1271, .1272, 
1277, 1278, 1280-1282, 1285, 1286, 1291-1293, 1296-1316, 1321-1326, 
1328-1345, 1348-1455, 1457, 1458, 1464-1490, 1496, 1497, 1511, 1513-1516, 
1518, 1519, 1523, 1525, 1526, 1528, 1529, 1533, 1535-1537, 1539-1542, 
1545-1552, 1560, 1561, 1567-1571, 1576, 1581, 1583, 1588-1591, 1593-1633, 
1635-1638, 1640-1642, 1644-1646, 1648-1654, 1656, 1661, 1662, 1673, 1674, 
1676, 1677, 1680, 1683, 1684, 1687, 1691, 1692, 1695, 1699, 1702, 1703, 
1707, 1714, 1718, 1719, 1727, 1728, 1730-1738, 1740-1744, 1746-1756, 
1758-1760, 1767, 1768, 1770, 1779, 1780, 1789, 1790, 1820, 1828, 1831, 
1833, 1836, 1839-1846, 1851-1859, 1861, 1863, 1865, 1869-1871, 1873-1877, 
1879, 1886-1890, 1892, 1896-1900, 1915, 1916, 1918, 1920, 1924, 1925, 
1927-1932, 1934, 1936-1950, 1952, 1953, 1956, 1961, 1966, 196B, 1969, 
1970, 1973-1980, 1983, 1984, 1987-1993, 1998, 2000-2004, 2006, 2007, 2011, 
2014, 2016-2029, 2034-2044, 2046, 2048-2056, 2061, 2063-2065, 2057, 2068, 
2072, 2075, 2085, 2086, 2091, 2095, 2096, 2106, 2108, 2109, 2111, 
2116-2118, 2120, 2122-2125, 2128, 2129, 2132, 2133, 2136-2143, 2146, 2147, 
2150, 2151, 2153, 2155, 2159, 2160, 2161, 2164, 2165, 2169, 2170 
2173-2176, 2179-2182, 2190-2192, 2199, 2200, 2203-2205, 2214, 2217-2222, 
2228, 2232, 2248, 2251, 2253, 2266, 2268-2271, 2280, 2283, 2291-2294, 2311, 
2313, 2324, 2327, 2328, 2339, 2340, 2349, 2350, 2355, 2358, 2362, 2370, 
2372, 2386, 2394, 2396, 2397, 2399, 2401, 2403, 2405-2407, 2410-2412, 
2415-2417, 2419-2421, 2423, 2424, 2426, 2442-2444, 2446, 2449, 2450, 2452, 
2454, 2459-2461, 2463, 2466-2468, 2473-2475, 2479, 2480, 2483, 2485, 2486, 
2491-2494, 2497-2500, 2505, 2506, 2512, 2520-2522, 2526, 2528-2530, 
2532-2535, 2539, 2540, 2543-2545, 2547, 2549-2568, 2571-2573, 2575, 
2576-2579, 2583, 2584, 2586-2589, 2591, 2597-2599, 2601-2603, 2605, 
2609-2612, 2614, 2615, 2617, 2618-2622, 2625-2627, 2630-2632, 2637-2640, 
2642-2647, 2649-2654 



<223> n = A,T,C or G 
<400> 2 

nnnnaagnnn nnaagngnnn nnggggagcc nggcnnnnnn agncgangaa ggangnnnnn 60 
nncnncnnna nncnnnggnn agnngnnnnn nnncnnnnna nccnnngnnc cgaaggggna 120 
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acccnnnnnn nnnnnrmnnn nnannnnnnn nnnnnnnnnn nnnimnnnnn ngnnnacnnn 180 
nngaangaaa cacnagannn nnaggaanag aaannaannn ngancnnnng agnggcgagc 240 
gaannngnan nagncnnnnn nnnnnnnnnn nnnnnnnnnn annngaannn nnggnaagnn 300 
nnnnnnnann nggnanannc cngannnnaa annnnnnnnn nnnnnnnnnn nnnnagannn 360 
cnnnncncgn gnnannnngn ngaannngnn nngaccannn nnnaagncaa aacnnnnnnn 420 
gaccnaagng nannagaong ganggaaagg ngaaaagnac ccnnnnnang ggaggaaana 480 
gnnccgaaac cnnnnncnan aannngnnna gnnnnnnnnn nnnnnganng cgnccgnann 540 
agnnncngng annnnnnnnn ngcnagnaan nnnnnnnngn agncgnagng aaancgagnn 600 
naanngngcg nnnagnnnnn gnnnnagacn cgaancnnng gancannnag nncaggngaa 660 
gnnnnngaan annnnnggag gnccgaacnn nnnnnnggaa aannnnnngg agannggnnn 720 
gnggngaaan ncnaancnaa cnnngnnaag cggccnncga aannnnaggn nnngcnnnnn 780 
nnnnnnnnnn nggnggagag cacgnnnnnn nnnnggnnnn nnnnnnnnna cnnannnnnn 840 
nnaaacnoga anccnnnnnn nnnnnnnnnn gnagnnannc nnngngngna annncnnngn 900 
nanagggnaa cancccagan cnncnnnaag gncccnaann nnnnnnaagg gnaaangang 960 
gnnnnnncnn anacannnag gangggcaga agcagccanc nnaaagahng cgaanagcca 1020 
cnncnagnnn nnnngcgcng annanancgg gncaannnnn nnnccgaann nnnngnnnnn 1080 
nnnnnnnnnn nnnnnnngga gnngagcgnn nnnnnnnnnn ngaagnnnnn nngnnannnn 1140 
nnnggannnn nnnnnaggng nagnngnnan agancgannn nnnnnggana nncnnnnncn 1200 
ccgnannncn aaggnccnnn nnnangnnnc nnnnnngggn agcgnnncca agnngagncn 1260 
ganangnnag nngaggnnan nnggnnaaoc nnnacnnnnn nnnnnnnnnn nnnnnngacg 1320 
nnnnnngnnn nnnnnnnnnn nnnnnggnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1380 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1440 
nnnnnnnnnn nnnnncnnga aaannnnnnn nnnnnnnnnn nnnnnnnnnn cgaccnnaaa 1500 
ccgacacagg ngnnnngnng agnanncnna ggngnnngnn nnaannnnnn nnaaggaacn 1560 
ngcaaannnn nccganccgg nanaaggnnn ncnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1620 
nnnnnnnnnn nnngnnnnan nnannngnnn nnnncnacga nnaaaaacac agnncnngcn 1680 
aanncgnaag nngangaang gnngacnccg cccnggcnng aaggaanngn nnnnnnnngn 1740 
nnnngnnnnn nnnnnnannn aagcccnngn aacggcggnn gaacaaacnn ccaaggagcg 1800 
aaaccgcggg aagccgaccn gcacgaangg ngnaangann nnnnnncgcc nnnnnnnnnc 1860 
ncngngaann nannnnngna agagcnnnnn cncgcnnnnn gacggaaaga ccccnngnan 1920 
oacnnannnn nnangnnnnn nnnnnnnnnn gnnagnaagg nggagncnnn gannnnnnnn 1980 
cgnnagnnnn nnnggagncn nnnngnnaac nacncnnnnn nnnnnnnnnc aacnnnnnnn 2040 
nnnnancnnn nnnnnngaca ngnnngnngg gnagnacggg gcggnncccc naaanngaac 2100 
ggaggngnnc naaggnnncn annnnggnng gnnacnnnnn nnnagnnaan ngnanaagnn 2160 
ngcnnacgnn agnnnnacnn nncgagcagn nncgaaagnn ggnnnaggac cggnggnnnn 2220 
nnggaagngc cncgccaacg gaaaaagnac ncnggggaaa caggcnannn ncccaagagn 2280 
canacgacgg nnnngggcac ccgagcggcc ncncacoggg gcgnagnngg cccaagggnn 2340 
ggcgcgccnn aaagnggnac gngagcgggn anaacgcgga gacagnggcc cacngnngng 2400 
ngngnnngan nngannngnn ngnncnagac gagaggaccg gnnngnacnn ancncgggnn 2460 
ncnggnnngc cannngcann gcngnnagca nnnnggnnnn gaaanngcga angcacaagn 2520 
nngaancnnn cnnnnagann agnnncncnn nnnnnnnnnn nnnnnnnnag nnncnnnnna 2580 
gannannnng ngaaggnnng nnngnaagnn nngnnannnn nnagnnnacn nnacaannnn 2640 
cnnnnnncnn nnnn 2654 

<210> 3 
<211> 13 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 3 

cgtggtgacc ctt 13 

<210> 4 
<211> 14 



4 



wo 2004/060278 



PCT/US2003/038761 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 4 

cgtcgtcacc gota 14 

<210> 5 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 5 

cgtggtaccc ctt 13 

<210> 6 
<211> 90 
<212> RNA 

<213> Bacillus anthracis 
<220> 

<221> misc_feature 
<222> 20 

<223> n = A, U, C or G 
<400> 6 

gcgaagaacc uuaccaggun uugacauccu cugacaaccc uagagauagg gcuucuccuu 60 
cgggagcaga gugacaggug gugcaugguu 90 

<210> 7 
<2H> 90 
<212> RNA 

<213> Bacillus cereus 
<400> 7 

gcgaagaacc uuaccagguc uugacauccu cugaaaaccc uagagauagg gcuucuccuu 60 
cgggagcaga gugacaggug gugcaugguu 90 

<210> 8 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PGR Primer 
<400> 8 

gtgagatgtt gggttaagtc ccgtaacgag 

<210> 9 
<211> 21 



30 
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<212> DKA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 9 

gacgtcatcc ccaccttcct c 

<210> 10 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> .10 

atgttgggtt aagtcccgca acgag 

<210>. 11 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 11 

ttgacgtcat ccccaccttc etc 

<210> 12 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 12 

ttaagtcccg caacgatcgc aa 

<210> 13 
<211> 22 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 13 

tgacgtcatc cccaccttcc tc 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PGR Primer 



<400> 14 

gctacacacg tgctacaatg 



20 



<210> 15 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 15 

cgagttgcag actgcgatcc g 21 

<210> 16 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



<210> 17 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 17 

gacgggcggt gtgtacaag 19 

<210> 18 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



<210>.19 
<21I> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 



<400> 16 

aagtcggaat cgctagtaat eg 



22 



<400> 18 

tgaacgctgg tggcatgctt aacac 



25 
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<223> PCR Primer 
<400> 19 

tacgcattac tcacccgtcc gc 22 

<210> 20 
<21L> 26 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 20 

gtggcatgcc taatacatgc aagtcg 26 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 22 

taacacatgc aagtcgaacg 20 

<210> 23 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 24 
<211> 18 
.<212> DNA 
<213> Artificial 

<220> 

<223> PCR Primer 



<400> 21 

ttactcaccc gtccgccgct 



20 



<400> 23 

ttactcaccc gtccgcc 



17 
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<400> 24 

gtgtagcggt gaaatgcg 



18 



<210> 25 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 25 

gtatctaatc ctgtttgctc cc 22 

<210> 26 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



<210> 28 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 28 

ggattagaga ccctggtagt cc 22 

<210> 29 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



<400> 26 

agaacaccga tggcgaaggc 



20 



<210> 27 
<211> 21 

<212> DNA » 

<213> Artificial Sequence 



<220> 

<223> PGR Primer 



<400> 27 

cgtggactac cagggtatct a 



21 



<400> 29 

ggccgtactc cccaggcg 



18 
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<210> 30 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 30 

ggattagata ccctggtagt ccacgc 26 

<210> 31 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 32 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 32 

tagataccct ggtagtccac gc 22 

<210> 33 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 34 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer . .... 

<400> 34 

ttcgatgcaa cgcgaagaac ct 22 



<400> 31 

ggccgtactc ccoaggcg 



18 



<400> 33 
cgtactcccc aggcg 



15 



<210> 35 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 35 

acgagctgac gacagccatg 

<210> 36 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 36 

acgcgaagaa ccttacc 

<210> 37 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<4 00> 37 

acgacacgag ctgacgac 

<210> 38 
<211> 18 
<212> DNA 
<213> Artificial 



<220> 

<223> PGR Primer 
<400> 38 

ctgacacctg cccggtgc 

<210> 39 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<400> 39 

gaccgttata gttacggcc 

<210> 40 
<211> 25 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> PGR Primer 
<400> 40 

tctgtcccta gtacgagagg accgg 

<210> 41 
<211> 18 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 41 

tgcttagatg ctttcagc 

<210> 42 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 42 

ctgtccctag tacgagagga ccgg 

<210> 43 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 43 

gtttcatgct tagatgcttt cage 

<210> 44 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<400> 44 

ggggagtgaa agagatcctg aaaccg 

<210> 45 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PGR Primer 
<400> 45 

acaaaaggta cgccgtcacc c 21 

<210> 46 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 4 6 

ggggagtgaa agagatcctg aaaccg 26 

<210> 47 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 47 

acaaaaggca cgccatcacc c 21 

<210> 48 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 48 

cgagagggaa acaacccaga cc 22 

<210> 49 
<2H> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR Primer 
<400> 49 

tggctgcttc taagccaac 19 
<210> 50 

<211>. 29 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



13 



wo 2004/060278 



PCT/US2003/038761 



<400> 50 

tgctcgtggt gcacaagtaa cggatatta 



29 



<210> 51 
<211> 29 
<212> DMA 

<213> Artificial Sequence 

<220> 

<223> PGR Primer 
<400> 51 

tgctgctttc goatggttaa ttgcttcaa 29 

<210> 52 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



<210> 53 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 53 

tcaagcgcca tttcttttgg taaaccacat 30 

<210> 54 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



<210> 55 
<2H> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



<400> 52 

caaaacttat taggtaagcg tgttgact 



28 



<400> 54 

caaaacttat taggtaagcg tgttgact 



28 



<400> 55 
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tcaagogcca tctctttcgg taatccacat 



30 



<210> 56 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 56 

taagaagccg gaaaccatca actaccg 27 

<210> 57 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 58 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 58 

tgattctggt gcccgtggt 19 

<210> 59 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 60 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<22Q> ... 
<223> PGR Primer 

<400> 60 

tgattccggt gcccgtggt 19 



<400> 57 



ggcgcttgta cttaccgcac 



20 



<400> 59 

ttggccatca ggccacgcat ac 



22 
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<210> 61 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR Primer 



<400> 61 

ttggccatca gaccacgcat ac 

<210> 62 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR Primer 



<400> 62 

ctggcaggta tgcgtggtct gatg 

<210> 63 
<211> 25 
<212> DMA 

<213> Artificial Sequence 



<220> 

<223> PCR Primer 



<400> 63 

cgcaccgtgg gttgagatga agtac 

<210> 64 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR Primer 



<400> 54 

cttgctggta tgcgtggtct gatg 

<210> 65 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR Primer 
<400> 65 

cgcaccatgc gtagagatga agtac 



<210> 66 
<211> 26 
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<212> DNA 

<213> iVrtificial Sequence 
<220> 

<223> PGR Primer 
<400> 66 

cgtcgggtga ttaaccgtaa caaccg 

<210> 67 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 67 

gtttttcgtt gcgtacgatg atgtc 

<210> 68 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 68 

cgtcgtgtaa ttaaccgtaa caaccg 

<210> 69 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<400> 69 

acgtttttcg ttttgaacga taatgct 

<210> 70 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 70 

caaaggtaag caaggtcgtt tccgtca 

<210> 71 
<2H> 24 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PGR Primer 
<400> 71 

cgaacggcct gagtagtcaa cacg 

<210> 72 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 72 

caaaggtaag caaggacgtt tccgtca 

<210> 73 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 73 

cgaacggcca gagtagtcaa cacg 

<210> 74 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 74 

tagactgccc aggacacgct g ' 

<210> 75 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 75 

gccgtccatc tgagcagcac c 

<210> 76 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> PGR Primer 



<400> 76 

ttgactgccc aggtcacgct g 



21 



<210> 77 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 77 

gccgtccatt tgagcagcac c 21 

<210> 78 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 79 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCK Primer 
<400> 79 

gttgtcgcca ggcataacca tttc 24 

<210> 80 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 81 
<2H> 24 
<2I2> DNA . 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<400> 78 



aactaccgtc cgcagttcta cttcc 



25 



<400> 80 

aactaccgtc ctcagttcta cttcc 



25 
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<400> 81 

gttgtcacca ggcattacca tttc 



24 



<210> 82 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 82 

ccacagttct acttccgtac tactgacg 28 

<210> 83 
<211> 30 
<212> DHA 

<2I3> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 84 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 84 

gacctacagt aagaggttct gtaatgaacc 30 

<210> 85 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<210> 86 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 86 

catccacacg gtggtggtga agg 23 



<400> 83 

tccaggcatt accatttcta ctccttctgg 



30 



<400> 85 

tccaagtgct ggtttaccco atgg 



24 



20 



wo 2004/060278 



PCT/US2003/038761 



<210> 87 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<2235 PCR Primer 
<400> 87 

gtgctggttt accccatgga gt 

<210> 88 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 88 

cgtgttgact attcggggcg ttcag 

<210> 89 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 8? 

attcaagagc catttctttt ggtaaaccac 

<210> 90 
<211> 29 
<212> DNA 

<213> Artificial Sequence 



<400> 90 

tcaacaacct cttggaggta aagctcagt 

<210> 91 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 91 

tttcttgaag agtatgagct gctccgtaag 
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<2H> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 92 

catccacacg gtggtggtga agg 

<210> 93 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 93 

tgttttgtat ccaagtgctg gtttacccc 

<210> 94 
<211> 20 
<212> DNA 



23 



29 



<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 94 

cgtggcggcg tggttatcga 20 

<210> 95 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 95 

cggtacgaac tggatgtcgc cgtt 24 

<210> 96 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

.<40Q> 96 . . 

tatcgctcag gcgaactcca ac 22 



<210> 97 
<211> 21 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> PGR Primer 



<400> 97 

gctggattcg cctttgctac g 



21 



<210> 98 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR Primer 
<400> 98 

tgtaatgaac cctaatgacc atccacacgg 30 

<210> 99 
<211> 27 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 



<210> 100 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 100 

taatgaaccc taatgaccat ccacacggtg 30 

<210> 101 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 101 

tccaagtgct ggtttacccc atggag 26 

<210> 102' ■ ■ 

<211> 29 
<212> DNA 

<213> Artificial Sequence 



<400> 99 

ccaagtgctg gtttacccca tggagta 



27 
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<220> 

<223> PGR Primer 
<400> 102 

cttggaggta agtctcattt tggtgggca 

<210> 103 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 103 

cgtataagct gcaccataag cttgtaatgc 

<210> 104 
<211> 18 
<212> DNA 



29 



30 



<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 104 

cgacgcgctg cgcttcac 18 

<210> 105 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 105 

gcgttccaca gcttgttgca gaag 24 

<210> 106 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR Primer 
<400> 106 

gaccacctcg gcaaccgt 18 
<210> 107 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
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<«00> 107 

ttcgctctcg gcctggcc 

<210> 108 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<400> 108 

gcactatgca cacgtagatt gtcctgg 

<210> 109 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 109 

tatagcacca tccatctgag cggcac 

<210> 110 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 110 

cggcgtactt caacgacagc ca 

<210> 111 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 111 

cgcggtcggc tcgttgatga 

<210> 112 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<400> 112 
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cttctgcaac aagctgtgga acgc 

<210> 113 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 113 . 

tcgcagttca tcagcacgaa gcg 

<210> 114 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 114 

aagacgacct gcacgggc 

<210> 115 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 115 

gcgctccacg tcttcacgc 

<210> 116 
<2H> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 116 

ctgttcttag tacgagagga cc 

<210> 117 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> . . 
<223> PCR Primer 

<400> 117 

ttcgtgctta gatgctttca g 
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<210> 118 

<:211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 118 

acgcgaagaa ccttacc 

<210> 119 

<211> 18 

■ <212> DNA 

<213> Artificial Sequence 

<220> 

<223> PCR Primer 

<400> 119 

acgacacgag ctgacgac 

<210> 120 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> . 

<223> PCR Primer 

<400> 120 
cgaagaacct tacc 

<210> 121 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 121 
acacgagctg ac 

<210> 122 

<211> 14 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 122 
cgaagaacct tacc 



<210> 123 
<211> 12 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 123 

acacgagctg ac 12 

<210> 124 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 124 

cc'tgataagg gtgaggtcg 19 

<210> 125 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 125 

acgtccttca tcgcctctga 20 

<210> 126 
<2H> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 126 

gttgtgaggt taagcgacta ag 22 

<210> 127 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 127 

ctatcggtca gtcaggagta t 21 

<210> 128 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PGR Primer 
<400> 128 

gttgtgaggt taagcgacta ag 

<210> 129 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<400> 129 

ttgcatcggg ttggtaagtc 

<210> 130 
<211> 21 
<212?- DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 130 

atactcctga ctgaccgata g 

<210> 131 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR Primer 
<400> 131 

aacatagcct tctccgtcc 

<210> 132 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 

<400> 132 

gacttaccaa cccgatgcaa 

<210> 133 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> PCR Primer 

<400> 133 

taccttagga ccgttatagt tacg 

<210> 134 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 134 

ggacggagaa ggctatgtt 

<210> 135 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 135 

ccaaacaccg ccgtcgatat 

<210> 136 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer • 
<400> 136 

cgtaactata acggtoctaa ggta 

<210> 137 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 137 

gcttacacac ccggcctatc 

<210> 138 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
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<400> 138 

atatcgacgg cggtgtttgg 

<210> 139 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 139 

gcgtgacagg caggtattc 

<210> 140 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<400> 140 

agtctcaaga gtgaacacgt aa 

<210> 141 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 141 

gctgctggca cggagtta 

<210> 142 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 142 

gacacggtcc agactcctac 

<210> 143 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

■ <220> 
<223> PGR Primer 

<400> 143 

ccatgcagca cctgtctc 
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<210> 144 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<400> 144 

gatctggagg aataccggtg 

<210> 145 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 145 

acggttacct tgttacgact 

<210> 146 
<211> 20 
<212>,DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 146 

gagagcaagc ggacctcata 

<210> 147 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 147 

cctcctgcgt gcaaagc 

<210> 148 
<211> 19 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer. . 
<400> 148 

tggaagatct gggtcaggc 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 149 

caatctgctg acggatctga gc 

<210> 150 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 150 

gtcgtgaaaa cgagctggaa ga 

<210> 151 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 151 

catgatggtc acaaccgg 

<210> 152 
<211> 22 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 152 

tggcgaacct ggtgaacgaa gc 

<210> 153 
<211> 25 
<212> OKA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 

<400> 153 

ctttcgcttt ctcgaactcai accat' 

<210> 154 
<211> 26 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> PGR Primer 
<400> 154 

cgtcagggta aattccgtga agttaa 

<210> 155 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<400> 155 

aacttcgcct tcggtcatgt t 

<210> 156 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 156 

ggtgaaagaa gttgcctcta aagc 

<210> 157 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 157 

ttcaggtcca tcgggttcat gcc 

<210> 158 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<400> 158 

cgtggcggcg tggttatcga 

<210> 159 
<2H> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PCR Primer 
<400> 159 

acgaactgga tgtcgccgtt 

<210> 160 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 160 

cggaattact gggcgtaaag 

<210> 161 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 161 

cgcatttcac cgctacac 

<210> 162 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 162 

acccagtgct gctgaaccgt gc 

<210> 163 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 163 

gttcaaatgc ctggataccc a 

<210> 164 
<211> 22 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
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<400> 164 

gggagcaaac aggattagat ac 

<210> 165 
<211> 15 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 

<400> 165 
cgtactcccc aggcg 

<210> 166 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 166 

tggcccgaaa gaagctgagc g 

<210> 167 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 167 

acgcgggcat gcagagatgc c 

<210> 168 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 168 

atgttgggtt aagtcccgc 

<210> 169 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
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tgacgtcatc cccaccttcc 

<210> 170 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 170 

cttgtacaca ccgcccgtc 

<210> 171 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 171 

aaggaggtga tccagcc 

<210> 172 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<400> 172 

cggattggag tctgcaactc g 

<210> 173 
<2H> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 173 

gacgggcggt gtgtacaag 

<210> 174 
<211> 15 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR Primer 

<400> 174 
ggtggatgcc ttggc 
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<210> 175 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 175 
gggtttcccc attcgg 

<210> 176 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 176 

gggaactgaa acatctaagt a 

<210> 177 
<211> 15 
<212> DNA 

<213> Artificial Sequence 



<400> 177 
ttcgctcgcc gctac 

<210> 178 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 178 

taccccaaac cgacacagg 

<210> 179 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 179 

ccttctcccg aagttacg 

<210> 180 
<211> 19 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 180 

ccgtaacttc gggagaagg 

<210> 181 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 181 
caccgggcag gcgtc 

<210> 182 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 182 
gacgcctgcc cggtgc 

<210> 183 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 183 

ccgacaagga atttcgctac c 

<210> 184 
<2H> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 184 

aaggtactcc ggggataaca. ggc 

<210> 185 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<400> 185 

agccgacatc gaggtgccaa ac 

<210> 186 
<211> 18 
<212> DMA 

<213> Artificial Sequence 



<400> 186 

gacagttcgg tccc'tatc 

<210> 187 
<2H> 17 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 187 

ccggtcctct cgtacta 

<210> 188 
<211> 17 
<212> DMA 

<213> Artificial Sequence 

<220> 

<223> PCR Primer 
<400> 188 

tagtacgaga ggaccgg 

<210> 189 
<211> 22 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 189 

ttagatgctt tcagcactta tc 

.<21Q> 190 . ... 
<211> 24 
<212> DMA 

<213> Artificial Sequence 



<220> 
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<223> ECR Primer 
<400> 190 

aaactagata acagtagaca tcac 

<210> 191 
<211> 18 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 191 

gtgcgccctt tctaactt 

<210> 192 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 192 

agagtttgat catggctcag 

<210> 193 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 193 
actgctgcct cccgtag 

<210> 194 
<2H> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 194 

cactggaact gagacacgg 

<210> 195 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
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<400> 195 

ctttacgccc agtaattccg 

<210> 196 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<400> 196 

ccagcagccg cggtaatac 

<210> 197 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 197 

gtatctaatc ctgtttgctc cc 

<210> 198 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 198 

gtgtagcggt gaaatgcg 

<210> 199 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR Primer 
<400> 199 

ggtaaggttc ttcgcgttg 

<210> 200 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<2205> 

<223> PGR Primer 



<400> 200 

aagcggtgga gcatgtgg 



42 



wo 2004/060278 PCTAJS2003/038761 



<210> 201 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 201 

attgtagcac gtgtgtagcc c 

<210> 202 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 202 

caagtcatca tggccctta 

<210> 203 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<400> 203 

aaggaggtga tccagcc 

<210> 204 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 204 

agagtttgat catggctcag 

<210> 205 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 205 

aaggaggtga tccagcc 



<210> 206 



wo 2004/060278 



PCT/US2003/038761 



<211> 19 
<212> DNA 

<213> Artificial Sequence 



<400> 206 

acctgcccag tgctggaag 

<210> 207 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 207 

tcgctaoctt aggaccgt 

<210> 208 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 208 

gccttgtaca cacctcccgt c 

<210> 209 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 209 

cacggctacc ttgttacgac 

<210> 210 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<4Q0> 210 

ttgtacacac cgcccgtcat ac 



<210> 211 
<2H> 20 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> PGR Primer 
<400> 211 

ccttgttacg acttcacccc 

<210> 212 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 212 

tacggtgaat acgttcccgg g 

<210> 213 • 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 213 

accttgttac gacttcaccc ca 

<210> 214 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 214 

accacgccgt aaacgatga 

<210> 215 
<2H> 21 
<212> DNA 

<213> Artificial Sequence 



<400> 215 

cccccgtcaa ttcctttgag t 

<210> 216 ■ 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PCR Primer 
<400> 216 

gataccctgg tagtccacac eg 

<210> 217 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 217 

gccttgcgac cgtactccc 

<210> 218 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 218 

tagataccct ggtagtccac go 

<210> 219 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 219 

gcgaccgtac tccccagg 

<210> 220 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PCR Primer 
<400> 220 

tagtcccgca acgagcgc 

<210> 221 
<211> 22 
■<212> DNA 

<213> Artificial Sequence 
<220> 

<223>'PCR Primer 
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<400> 221 

gacgtcatcc ccaccttcct cc 

<210> 222 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 222 

tagaacgtcg egagacagtt eg 

<210> 223 
<211> 20 
<212> DHA 

<213> Artificial Sequence 



<400> 223 

agtccatccc ggtcctctcg 

<210> 224 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 224 

gaggaaagtc cgggctc 

<210> 225 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 225 

ataagccggg ttctgtcg 

<210> 226 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
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gaggaaagtc catgctcgc 

<210> 227 
<211> 22 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 227 

gtaagccatg ttttgttcca tc 

<210> 228 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 228 

gaggaaagtc cgggctc 

<210> 229 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 229 

ataagccggg ttotgtcg 

<210> 230 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 230 

gcgggatcct ctagaggtgt taaatagcct ggcag 

<210> 231 
<211> 35 
<212> DNA 

<213> Artificial Sequence 



<400> 231 

gcgggatcct ctagaagaoc tcctgcgtgc aaagc 
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<210> 232 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 232 

gaggaaagtc catgctcac 

<210> 233 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 233 

ataagccatg ttctgttcca tc 

<210> 234 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<400> 234 

atgttgggtt aagtcccgc 

<210> 235 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 235 

aaggaggtga tccagcc 

<210> 236 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<400> 236 

cggaattact gggcgtaaag 



<210> 237 
<211> 22 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 237 

gtatctaatc ctgtttgctc cc 22 

<210> 238 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 238 

atgttgggtt aagtcccgc 19 

<210> 239 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 239 

tgacgtcatg cccaccttcc 20 

<210> 240 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 240 

atgttgggtt aagtcccgc _ 19 

<210> 241 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 241 

tgacgtcatg gccaccttcc .. 20 

<210> 242 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
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<400> 242 

gcgggatcct ctagacctga taagggtgag gtcg 

<210> 243 
<211> 34 
<212> DNA 

<213> Artificial Sequence 



<400> 243 

gcgggatcct ctagagcgtg acaggcaggt atto 

<210> 244 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 244 

acgcgaagaa ccttacc 

<210> 245 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 245 

gacgggcggt gtgtacaag 

<210> 246 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer. 

<400> 246 

gtgtagcggt gaaatgcg 

<210> 247 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> PGR Primer 
<400> 247 

cgagttgcag actgcgatcc g 

<210> 248 
<211> 20 
<212> Dm 

<213> Artificial Sequence 
<220> 

<223> ECR Primer 
<400> 248 

taacacatgc aagtcgaacg 

<210> 249 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 249 
cgtactcccc aggcg 

<210> 250 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 250 

taacacatgc aagtcgaacg 

<210> 251 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 251 

acgacacgag ctgacgac 

<210> 252 
<211> 22 
<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
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<400> 252 

acaacgaagt acaatacaag ac 

<210> 253 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<400> 253 

cttctacatt tttagccatc ac 

<210> 254 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 254 

ttaagtcccg caacgagcgc aa 

<210> 255 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 255 

tgacgtcatc cccaccttcc tc 

<210> 256 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PCR Primer 
<400> 256 

tgagtgatga aggccttagg gttgtaaa 

<210> 257 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 257 

cggctgctgg cacgaagtta g 
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<210> 258 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 258 

atggaoaagg ttggcaagga agg 

<210> 259 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
•<220> 

<223> PGR Primer 
<40O> 259 

tagccgcggt cgaattgcat 

<210> 260 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 260 

aaggaaggcg tgatcaccgt tgaaga 

<210> 261 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> FOR Primer 
<400> 261 

ccgcggtcga attgcatgcc ttc 

<210> 262 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 

<400> 262 
acgcgctgcg cttcac 

<210> 263 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 263 

ttgcagaagt tgcggtagcc 

<210> 264 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<400> 264 

tcgaccacct gggcaacc 

<210> 265 
<211> 19 
<212? DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 265 

atcaggtcgt gcggcatca 

<210> 266 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<400> 266 

cacggtgccg gcgtact 

<210> 267 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<4.00> 267 

gcggtcggct cgttgatgat 

<210> 268 
<211> 25 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> PCR Primer 
<400> 268 

ttggaggtaa gtctcatttt ggtgg 

<210> 269 
<211>.25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 269 

aagctgcacc ataagcttgt aatgc 

<210> 270 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 270 

cagcgtttcg gcgaaatgga 

<210> 271 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 271 

cgacttgacg gttaacattt cctg 

<210> 272 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 272 

gggcagcgtt tcggcgaaat gga 

<210> 273 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PCR Primer 
<400> 273 

gtccgacttg acggtcaaca tttcctg 

<210> 274 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 274 

caggagtcgt tcaactogat ctacatgat 

<210> 275 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 275 

acgccatcag gccacgcat 

<210> 276 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 276 

gcacaacctg cggctgcg 

<210> 277 
<211> 18 
<212> DNA 

<213> Artificial Se(iuence 



<400> 277 

acggcacgag gtagtcgc 

<210> 278 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR Primer 
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tgttactcac ccgtctgcca ct 

<210> 284 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 284 

accgagcaag gagaccagc 

<210> 285 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 285 

tataacgcac atcgtcaggg tga 

<210> 285 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<400> 286 

agacccaatt acattggctt 

<210> 287 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 287 

ccagtgctgt tgtagtacat 

<210> 288 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> ... 
<223> PCR Primer 

<400> 288 

atgtactaca acagtactgg 
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<210> 289 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 289 

caagtcaacc acagcattca 

<210> 290 
<211> 20 
<212> Dm 

<213> Artificial Sequence 



<400> 290 

gggcttatgt actacaacag 

<210> 291 
<211> 21 
<212> DNA 

<213> Artificial Sequence ■ 
<220> 

<223> PGR Primer 
<400> 291 

tctgtcttgc aagtcaacca c 

<210> 292 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 292 

ggaatttttt gatggtagag a 

<210> 293 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 293 

taaagcacaa tttcaggcg 



<210> 294 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 294 

tagatctggc tttctttgac 

<210> 295 
<211> 21 
<212> DNA 
<213> Artificial 



<220> 

<223> PCR Primer 
<400> 295 

atatgagtat ctggagtctg c 

<210> 296 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<4O0> 296 

ggaaagacat tactgcagac a 

<210> 297 
<2H> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 297 

ccaacttgag gctctggctg 

<210> 298 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 298 

acagacactt accagggtg 

<210> 299 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
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<220> 

<223> PCR Primer 
<400> 299 

actgtggtgt catctttgtc 

<210> 300 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<4 00> 300 

tcactaaaga caaaggtctt cc 

<210> 301 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 301 

ggcttcgccg tctgtaattt c 

<210> 302 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<400> 302 

cggatccaag ctaatctttg g 

<210> 303 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 303 

ggtatgtact cataggtgtt ggtg 

<210> 304 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> PCR Primer 
<400> 304 

agacccaatt acattggctt 

<210> 305 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 305 

ccagtgctgt tgtagtacat 

<210> 306 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<400> 306 

atgtactaca acagtactgg 

<210> 307 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 307 

caagtcaacc acagcattca 

<210> 308 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 308 

gggcttatgt actacaacag 

<210> 309 
<211> 21 
<212>. DNA . . 

■<213> Artificial Sequence 
<220> 

<223> PCR Primer 
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<400> 309 

tctgtcttgc aagtcaacca c 

<210> 310 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 310 

ggaatttttt gatggtagag a 

<210> 311 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 311 

taaagcacaa tttcaggcg 

<210> 312 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<40O> 312 

tagatctggc tttctttgac 

<210> 313 
<211> 21 
<2I2> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<40O> 313 

atatgagtat ctggagtctg c 

<210> 314 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<40Q> 314 

cggatccaag ctaatctttg g 
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<210> 315 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<400> 315 

ggtatgtact cataggtgtt ggtg 

<210> 316 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<400> 316 

aacagaccca attacattgg ctt 

<210> 317 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 317 

gaggcacttg tatgtggaaa gg 

<210> 318 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<400> 318 

atgcctaaca gacccaatta cat 

<210> 319 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer - 
<400> 319 

ttcatgtagt cgtaggtgtt gg 



<210> 320 
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<220> 

<223> PGR Primer 
<400> 330 

ggtcgttatg tgcctttcca cat 23 

<210> 331 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 331 

tcctttctga agttccactc atagg 25 

<210> 332 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 332 

acaacattgg ctaccagggc tt 22 

<210> 333 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 333 

cctgcctgct cataggctgg aagtt 25 

<210> 334 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR Primer i 
<400> 334 

ggattagaga ccctggtagt cc 22 
<210> 335 

<211> 18 . ... 

<212> -DNA •• 

<213> Artificial Sequence 

<220> 

<223> PGR Primer 
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<400> 335 

ggccgtactc cccaggcg 

<210> 336 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 336 

ttcgatgcaa cgcgaagaac ct 

<210> 337 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 337 

acgagctgac gacagccatg 

<210> 338 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

.<223> PCR Primer 
<400> 338 

tctgtcccta gtacgagagg accgg 

<210> 339 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 339 

tgcttagatg ctttcagc 

<210> 340 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
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c1:ggcaggta tgcgtggtct gatg 

<210> 341 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 341 

cgcaccgtgg gttgagatga agtac 

<210> 342 
<211> 30 
<212> DNft 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 342 

ggggattcag ccatcaaagc agctattgac 

<210> 343 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220S> 

<223> PCR Primer 
<400> 343 

ccaacctttt ccaoaacaga atcagc 

<210> 344 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 344 

ccttacttcg aactatgaat cttttggaag 

<210> 345 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PCR Primer 



<400> 345 

cccatttttt cacgcatgct gaaaatatc 



29 



wo 2004/060278 PCT/US2003/038761 



<210> 346 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 

<400> 346 

cgcaaaaaaa tccagctatt ago 

<210> 347 
<2n> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 347 

aaactatttt tttagctata ctcgaacac 

<210> 348 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 348 

atgattacaa ttcaagaagg tcgtcacgc 

<210> 349 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 349 

ttggacctgt aatcagctga atactgg 

<210> 350 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 35D 

gatgactttt tagctaatgg tcaggcagc 



<210> 351 
<211> 29 



wo 2004/060278 



PCT/US2003/038761 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 351 

aatcgacgac catcttggaa agatttctc 

<210> 352 
<2H> 25 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 352 

gcttcaggaa tcaatgatgg agcag 25 

<210> 353 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 353 

gggtctacac ctgcacttgc ataac 25 

<210> 354 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 354 

gtactgaatc cgcctaag 18 

<210> 355 • 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 355 

gtgaataaag tatcgcccta ata . . . . 23 

<210> 356 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PGR Primer 
<400> 356 

gaagttgaac cgggatca 

<210> 357 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<400> 357 

attatcggtc gttgttaatg t 

<210> 358 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 358 

ctgtctgtag ataaactagg att 

<210> 359 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 359 

cgttcttctc tggaggat 

<210> 360 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 

<400> 360 
cgatactacg gacgc 

<210.> 361 
<211>-22 
<212> DNA 

<213> Artificial Sequence 
<220> 



73 



wo 2004/060278 



PCT/US2003/038761 



<210> 372 
<211> 15 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR Primer 

<400> 372 
cgatactacg gacgc 

<210> 373 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 373 

ctttatgaat tactttacat at 

<210> 374 
<211> IB 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 374 

ctcctccatc actaggaa 

<210> 375 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<4D0> 375 

ctataacatt caaagcttat tg 

<210> 376 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PGR Primer 



<400> 376 

cgcgataata gatagtgcta aac 
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<2il> 19 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PCR Priit^lj^ 
<400> 377 

gcttccacca ggtcattaa 
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